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pression regulation sequence, a secretory DNA sequence encoding a secretory signal which is functional in mammary secretory 
cells of the bovine species and a recombinant DNA sequence encoding the recombinant polypeptides. A method for producing 
transgenic bovine species comprises introducing the above transgene into an embryonal target cell of a bovine species, transplan- 
ting the transgenic embryonic target cell formed thereby into a recipient bovine parent and identifying at least one female off- 
spring which is capable of producing the recombinant polypeptide in its milk. The invention also includes transgenic bovine spe- 
cies capable of producing recombinant polypeptides in transgenic milk as well as the milk from such transgenic bovine species 
and food formulations containing one or more recombinant polypeptide. A method for producing transgenic non-human mam- 
mals having a desirable phenotype comprises first methylating a transgene followed by introduction into fertilized oocytes. The 
oocytes are then cultured to form preimplantation embryos. Thereafter, at least one cell is removed from each of the pre-implan- 
tation embryos and the DNA digested with a restriction endonuclease capable of cleaving the methylated transgene but incapable 
of cleaving the unmethylated form of the transgene. 



FOR TUB PURPOSES OP INFORMATION ONLY 



Codes used to identify Stales party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 




PI 


Finland 


ML 


Mali 


AU 


Australia 




PR 


France 


MN 


Mongolia 


aa 


Barbados 




CA 


Gabon 


MR 


Mauritania 




Belgium 




CB 


United Kingdom 


MW 


Malawi 


BP 


Burkina Faso 




CN 


Guinea 


NL 


Netherlands 


ac 


Bulgaria 




CR 


Greece 


NO 


Norway 


aj 


Benin 




HU 


Hungary 


PL 


Poland 


BR 


Brazil 




IT 


Italy 


RO 


Romania 


CA 


Canada 




Jp 


Japan 


so 


Sudan 


CP 


Central African Rq 


public 


HP 


Democratic People** Republic 


SB 


Sweden 


CC 


Congo 






of Korea 


SN 


Senegal 


CH 


SwHzerland 




KR 


Republic of Korea 


SU 


Soviet Union 


a 


Cottdlvoire 




U 


Liechtenstein 


TO 


Chad 


CM 


Cameroon 




Ut 


Sri Lanka 


TO 


Togo 


DK 


Gcmany 




LU 


Luxembourg 


US 


United States of America 


DK 


Denmark 




MC 


Monaco 






es 


Spain 




MC 


Madagascar 







WO 91/08216 PCT/US90/06874 



1 



PRODUCTION OP RECOMBINANT POLYPEPTIDES 
BY BOVINE SPECIES AND TRANSGENIC METHODS 

Field of the Invention 

The invention relates to the production of recombinant 
polypeptides by transgenic bovine species and to methods 
for producing transgenic non-human mammals having a 
5 desired phenotype. 

Background of the Invention 

There is a plethora of literature relating to the 
expression of heterologous genes in lower organisms such 
as unicellular bacteria, yeast and filamentous fungi, 

10 and in higher cell types such as mammalian cells. There 
are also numerous reports on the production of 
transgenic animals, most of which relate to the 
production of transgenic mice. See, e.g. U.S. Pat. No. 
4,736,866 (transgenic mice containing activated 

15 oncogene); Andres, A., et al. (1987) Ftps, Pafrl, ft<?frfl f 
Sci. USA 84 . 1299-1303 (HA-RAS oncogene under control 
of whey acid protein promoter) ; Schoenberger, C.A. , et 
al. (1987) Exnerientia A2, 644 and (1988) J- 7. 

169-175 (C-myc oncogene under control of whey acid 

20 protein promoter) ; and Muller, W.J., et al. (1988) QeU 
54 , 105-115 (C-myc oncogene under control of the mouse 
mammary tumor virus promoter) . Several laboratories 
have also reported the production of transgenic porcine 
species (Miller, K.F., et al. (1989) , J t EnflPCrintt 12£, 

25 481-488 (expression of human or bovine growth hormone 
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gene in transgenic swine); Vize, P.O., et al. (1988), 
J T Cell Sci. f 9Q, 295-300 (porcine growth hormone fusion 
gene in transgenic pigs); and Ebert, K. et al. (1988), 
Mol. Endocrin. . £, 277-283 (MMLV-rat somatotropin fusion 
5 gene in transgenic pigs) ) , transgenic sheep (Nancarrow, 

et al. (1987) , Ther ioaenolocry . 21$ 263 (transgenic sheep ? 
containing bovine growth hormone gene) Clark, A.J. et 
al. (1989) Bio /Technology 7 r 487-482 and Simons, J., et 
al. (1988) Pio/TeghnQloqY &, 179-183 (human factor IX 

10 and ct-1 antitrypsin CONA in ovine species) , and rabbit 
(Hanover, S.V., et al. (1987), Pggtchq fieramUPhQ 
Wochenschr if t . 94 , 476-478 (production of transgenic 
rabbits by injection of uteroglobin-promoter-CAT fusion 
gene into fertilized rabbit oocytes) . A number of 

15 reports have also suggested the production of transgenic 
cattle (Wagner, et al. (1984) , TherioaenoloaY . 21 , 29- 
44) with one reporting some progress in microinjection 
techniques (Lohse, J.K. , et al. (1985), Ther iocrenology . 
22, 205) • However, little, if any, success has been 

20 achieved in producing transgenic cows. Scientific 
articles which clearly demonstrate the actual production 
of a transgenic cow capable of producing a heterologous 
protein are presently unknown. This, despite the 
statements that one transgenic cow was produced in 

25 Canada which expressed human ^-interferon (Van Brunt, 
J. (1988), pio/TgghnQlogY, fi, 1149-1155) and that 
transient expression of human a-f etoprotein in liver and 
blood was obtained on one occasion (Church, R.B* (1986) , 
Biotechnology News Watch . 6 (15) , 4) . One reference 

30 reports that bovine papilloma virus was apparently 
integrated but not expressed in a transgenic cow 
(Roschlau et al. (1988) Arch. Tierz.. Berlin 31, 3-8) . 
A recent article has summarized the genetic engineering * 
of livestock. (Pursel, V.G. et al. (1989) , Science . 

35 2Mr 1281-1288) . 
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A number of laboratories have reported tissue-specific 
expression of DNA encoding various proteins in the 
mammary gland or the production of various proteins in 
the milk of transgenic mice and sheep. For example, 
5 Simmons, J.P., et al. (1987) Nature 328. 530-532 report 
the microinjection of a I6.2kb genomic fragment encoding 
0-lactoglobulin (BLG) including 4kb of 5 1 sequence, 
4.9kb of the BLG transcription unit and 7.3kb of 3 1 
flanking sequence into fertilized mouse eggs. According 

10 to these authors, the sheep BLG was expressed in mammary 
tissue and produced BLG in the milk of the transgenic 
mice at concentrations ranging from about 3.0 to about 
23 mg/ml. When, however, cDNA encoding human factor IX 
or human ^l-^pti trypsin was inserted into the 5 1 

15 untranslated region of the BLG gene and microinjected 
into sheep (Simmons, J.P., et al. (1988) Bi9 /Technology 
£, 179-183) the production of factor IX or al- 
antitrypsin was significantly reduced (25ng/ml for 
factor IX and lOmg/ml tot al-antitrypsin; see Clark, 

20 A.J., et al. (1989) Bio /Technology 7. 487-492). 

In a similar approach, a 14kb genomic clone containing 
the entire 7.5kb rat 0-casein together with 3.5kb of 5 1 
and 3.0kb of 3 1 flanking DNA was reportedly 
microinjected into fertilized mouse oocytes. Lee, et 
25 al. (1988) Nucl- Acids Res. 16 1027-1041. Yet, in this 
case, the level of expression of the rat /3-transgene in 
the lactating mammary gland of transgenic mice was 
reported to be at a level of 0.01-1% of the endogenous 
mouse 0-casein gene. 

30 Human tissue plasminogen activator (t-PA) reportedly was 
produced in transgenic mouse milk at the levels between 
0.2 and about 0.4pg/ml when a cDNA encoding a human t-PA 
with its endogenous secretion sequence was expressed 
under control of a 2.6kb 5 1 sequence of the murine whey 

35 acid protein gene. Gordon, K. , et al. (1987) 
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Bio /Technology 5, 1183-1187. Subsequent experiments 
using the same or similar construction reportedly 
produced t-PA in different mouse lines arranging from 
less than 20ng of t-PA per ml of milk to about 50/xg/ml. 
5 Pittius, C.W., et al. (1988) Proc. Natl. Acad, Sci. USA 
35/ 5874-5878. 

U.S. Patent No. 4,873,316 issued October 10, 1989, 
discloses the use of 9kb of 5 V sequence from the bovine 
aSl-casein gene including the casein signal peptide and 
10 several casein codons fused to a mature t-PA sequence. 
The transgenic mice obtained with this construct 
reportedly produced about 0.2-0.5/ig/ml of a t-PA fusion 
protein in their milk. 

In addition, a number of patent publications purportedly 

15 describe the production of specific proteins in the milk 
of transgenic mice and sheep. See, e.g. European Patent 
Publication No. 0 264 166 published April 20, 1988 
(hepatitis B surface antigen and t-PA genes under 
control of the whey acid promoter protein for mammary 

20 tissue specific expression in mice) ; PCT Publication No. 
W088/00239 published January 14, 1988 (tissue specific 
expression of a transgene encoding factor IX under 
control of a whey protein promoter in sheep) ; PCT 
Publication No. W088/ 01648 published March 10, 1988 

25 (transgenic mouse having mammary secretory cells 
incorporating a recombinant expression system comprising 
a bovine ot-lactalbumin gene fused to inter leukin-2) ; 
European Pat. Pub. No. 0 279 582 published August 24, 
1988 (tissue-specific expression of chloramphenicol 

30 acetyl transferase under control of rat 0-casein promoter 
in transgenic mice) ; and PCT Pub. No. W088/ 10118 
published December 29, 1988 (transgenic mice and sheep 
containing transgene encoding bovine aSl-casein promoter 
and signal sequence fused to t-PA) . 
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Given the state of the transgenic art, it is apparent 
that a need exists for methods which enable the 
efficient production of transgenic mammals, especially 
transgenic mammals other than transgenic mice. 

5 Further, it is apparent that a need exists for methods 
for producing transgenic bovine species which are 
capable of producing recombinant polypeptides such as 
human milk proteins and human serum proteins in the milk 
of such transgenic mammals. 

10 Accordingly, it is an object herein to provide methods 
for detecting the transgenesis of fertilized oocytes 
prior to implantation. 

In addition, it is an object herein to provide 
transgenic bovine species which are capable of producing 
15 recombinant polypeptides which are maintained 
intracellular ly or are secreted extracellular ly. 

It is also an object herein to provide transgenic bovine 
species which are capable of producing recombinant 
polypeptides such as human milk proteins and human serum 
20 proteins in the milk of such transgenic animals. 

Further, it is an object herein to provide milk from a 
transgenic bovine species containing such recombinant 
polypeptides • 

Still further, it is em object herein to provide food 
25 formulations supplemented with recombinant polypeptides 
from such transgenic milk such as human infant formula 
supplemented with human lactoferrin. 



Further, it is an object herein to provide transgenes 
which are capable of directing the production of 



recombinant polypeptides in the milk of transgenic 
bovine species* 

The references discussed herein are provided solely for 
their disclosure prior to the filing date of the present 
5 application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to 
antedate such disclosure by priority based on earlier 
filed applications. 

f^W^TT nfr fM Invention 

10 In accordance with the above objects, the invention 
includes transgenes for producing recombinant 
polypeptides in the milk of transgenic bovine species. 
The production of such transgenic bovine milk containing 
one or more recombinant polypeptides is desirable since 

15 it provides a matrix wherein little or no purification 
is necessary for human consumption. The transgene 
comprises a secretory DNA sequence encoding a secretory 
signal sequence which is functional in mammary secretory 
cells of the bovine species of interest and a 

20 recombinant DNA sequence encoding the recombinant 
polypeptide. These sequences are operably linked to 
form a secretory-recombinant DNA sequence. At least one 
expression regulation sequence, functional in the 
mammary secretory cells of the bovine species, is 

25 operably linked to the secretory-recombinant DNA 
sequence. The transgene so constructed is capable of 
directing the expression of the secretory-recombinant 
DNA sequence in mammary secretory cells of bovine 
species containing the transgene. Such expression 

30 produces a form of recombinant polypeptide which is 
secreted from the mammary secretory cells into the milk 
of the transgenic bovine species. 

In addition, the invention includes methods for 
producing such transgenic bovine species. The method 
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includes introducing the above transgene into an 
embryonal target cell of a bovine species, transplanting 
the transgenic embryonic target cell formed thereby into 
a recipient bovine parent and identifying at least one 
5 female offspring which is capable of producing the 
recombinant polypeptide in its milk. 

The invention also includes transgenic bovine species 
capable of producing recombinant polypeptides in the 
milk of lactating females of said species, the milk from 
10 such transgenic bovine species containing such 
recombinant polypeptides and food formulations 
containing the transgenic milk in liquid or dried form, 
as well as food formulations supplemented with one or 
more recombinant polypeptides from such transgenic milk. 

15 In addition to the foregoing, the invention includes 
transgenes and transgenic bovine species containing 
transgenes that are capable of producing a recombinant 
polypeptide. Such transgenes are similar to the 
aforementioned transgenes for milk secretion and are 

20 characterized by having an expression regulation 
sequence which targets the expression of the DNA 
encoding the recombinant polypeptide to a particular 
cell or tissue type, e.g. expression of human serum 
albumin in the liver of a transgenic bovine species. 

25 When the recombinant polypeptide is to be secreted from 
such targeted cells or tissues, a secretory DNA sequence 
encoding a secretory signal sequence functional in the 
particular targeted cell or tissue is operably linked 
to the recombinant DNA sequence encoding the recombinant 

30 polypeptide, e.g. secretion of human serum albumin from 
bovine liver into the bovine circulatory system. 

Further, the invention includes methods for producing 
transgenic non-human mammals having a desirable 
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phenotype. The method comprises first causing the 
methylation of a transgene capable of conferring the 
desirable phenotype when incorporated into the cells of 
a transgenic non-human animal, e.g. , by transforming an 
5 appropriate bacterium, such as £. coll MM 294, with a 
plasmid containing the transgene. The methylated 
transgene is then excised and introduced into fertilized 
oocytes of the non-human animal to permit integration 
into the genome. The oocytes are then cultured to form 

10 pre-implantation embryos thereby replicating the genome 
of each of the fertilized oocytes. Thereafter, at least 
one cell is removed from each of the pre-implantation 
embryos and treated to release the DNA contained 
therein. Each of the releasee^. are then digested 

15 with a restriction endonuclease capable of cleaving the 
methylated transgene but incapable of cleaving the 
unmethylated form of the transgene formed after 
integration into and replication of the genomic DNA. 
Those pre-implantation embtyos which have integrated the 

20 transgene contain DNA which is resistant to cleavage by 
the restriction endonuclease in the region containing 
the transgene. This resistance to digestion, which can 
be detected by electrophoresis of the digest after PGR 
amplification of the DNA and hybridization with a 

25 labelled probe for the transgene, facilitates the 
identification of successful transgenesis. 

The invention also includes a method to produce a 
population of transgenic of f spring having the same 
genotype. This method utilizes a specific embodiment 

30 of the above method for detecting early transgenesis. 
In this method, a methylated transgene is introduced 
into fertilized oocytes which are cultured to pre- 
implantation embryos. Thereafter, each pre-implantation 
embryo is divided to form first and second hemi-embryos . 

35 Each of the first hemi-embryos are then analyzed for 
transgenesis as described above. After identifying 
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successful transgenesis in at least one first 
hemi-embryo , the second untreated hemi-embryo which 
contains the integrated transgene, is cloned to form a 
multiplicity of clonal transgenic blastocysts or hemi- 
5 blastocysts, each of which have the same genotype. The 
transgenic embryos are thereafter transplanted into one 
or more recipient female parents to produce a population 
of transgenic non-human mammals having the same 
genotype* 

10 Brief Desc ription of the Drawings 

The accompanying drawings, which are incorporated in and 
form a part of the specification, illustrate embodiments 
of the present invention - and, together with the 
description, serve to explain the principles of the 

15 invention. In the drawings: 

Fig. 1 depicts the DNA (Seq. ID No.: 1) and amino acid 
(Seq. ID No. : 2) sequence for a human lactoferrin clone 
derived from a human mammary cDNA library as described 
herein except that the sequence between nucleotides 
20 1557-1791 and 2050-2119 corresponds to the previously 
published sequence (Rado et al. (1987) Blood 2£, 989- 
993) . 

Fig. 2 depicts the complete DNA (Seq. ID No. : 3) and 
amino acid (Seq. ID No.: 4) sequence of human 
25 lactoferrin including 5* and 3" untranslated sequence 
as well as the complete human lactoferrin signal 
sequence. 

Fig. 3 is a restriction map of a clone of a 5 1 -flanking 
region of bovine aSl-casein gene. 

30 Fig. 4 is a restriction map of a clone of a 3 •-flanking 
region of bovine aSl-casein gene. 



-10- 

Figs. 5A f 5B and 5C depict the construction of 
pSI3'5 l C&T and pSIS'CAT. 

Fig. 6 depicts pMH-1. 

Figs. 7A through 7F depict the construction of 
5 expression vectors containing sequences encoding human 
lactoferrin. 

Fig* 8 depicts the genome of human serum albumin, the 
fragments used to generate transgenic mice contained in 
this genomic DNA and the identification of the fragment 
10 sizes which would be obtained upon the digestion of 
genomic DNA from a transgenic mouse with the restriction 
enzymes BstE-II and Nco-I or with Nco-I and Hindi-Ill. 

Fig. 9 depicts an alternate pathway for the construction 
of a transgene of the invention encoding human 
15 lactoferin. 

Fig. 10 depicts the construction of a plasmid pPC 
containing a transgene encoding Protein C. 

Fig . 11 depicts the DNA sequence for a hybrid 
intervening sequence used in a preferred embodiment of 
20 the invention. This hybrid sequence comprises a 5' 
portion from an intervening sequence of bovine 
aSl-casein and a 3 • portion from an intervening sequence 
of an IgG intervening sequence. The juncture of the 5 1 
and 3' portion is the Hindlll site shown. 

25 Detailed Description of the Invention 

The H non-human mammals' 1 of the invention comprise all 
non-human mammals capable of producing a "transgenic 
non-human mammal" having a "desirable phenotype" . Such 
mammals include non-human primates, murine species, 

30 bovine species, canine species, etc* Preferred non- 
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human animals include bovine, porcine and ovine species, 
most preferably bovine species* 

Desirable phenotypes for transgenic non-human mammals 
include, but are not limited to, the production of 
5 recombinant polypeptides in the milk of female 
transgenic non-human mammals, the production of animal 
models for the study of disease, the production of 
animals with higher resistance to disease (e.g. diseases 
of the mammary gland such as mastitis) and the 

10 production of recombinant polypeptides in the blood, 
urine or other suitable body fluid or tissue of the 
animal. In the preferred embodiments, transgenic bovine 
species are disclosed which are capable of producing 
recombinant human lactoferrin, human serum albumin and 

15 human Protein C in the milk of lactating females or 
human serum albumin in the liver of the transgenic 
animal. 

The transgenic non-human mammals of the invention are 
produced by introducing a "transgene" into an embryonal 

20 target cell of the animal of choice. In one aspect of 
the invention, a transgene is a DNA sequence which is 
capable of producing a desirable phenotype when 
contained in the genome of cells of a transgenic non- 
human mammal. In specific embodiments, the transgene 

25 comprises a "recombinant DNA sequence" encoding a 
"recombinant polypeptide". In such cases, the transgene 
is capable of being expressed to produce the recombinant 
polypeptide* 

As used herein, a "recombinant polypeptide" (or the 
30 recombinant DNA sequence encoding the same) is either 
a "heterologous polypeptide" or a "homologous 
polypeptide". Heterologous polypeptides are 

polypeptides which are not normally produced by the 
transgenic animal. Examples of heterologous 
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polypeptides include human milk proteins such as 
lactof err in , lysozyme , secreted immunoglobulins , 
lactalbumin, bile salt-stimulated lipase, etc., human 
serum proteins such as albumin, immunoglobulins. Factor 
5 VIII, Factor IX, protein C, etc. and industrial enzymes 
such as proteases, lipases, chitinases, and liginases 
from procaryotic and eucaryotic sources. The 
recombinant DNA sequences include genomic and cDNA 
sequences encoding the recombinant polypeptide. 

10 When recombinant DNA sequences encoding a heterologous 
polypeptide are used, the transgene may be integrated 
in a random manner into the genome of the species used 
for transgenesis. As disclosed in the Examples, 
transgenes encoding human lactof err in, human serum 

15 albumin and human Protein C in conjunction with a aSl- 
casein secretory signal sequence under control of aSl- 
casein expression regulation sequences are designed to 
produce and secrete these heterologous polypeptides from 
the mammary gland of a lactating transgenic mammal into 

20 its milk. 

As used herein, a homologous polypeptide is one which 
is endogenous to the particular transgenic species. 
Examples of endogenous polypeptides from bovine species 
include bovine milk proteins such as aSl, aS2, |S- and 

25 jc-casein, 0-lactoglobulin lactoferrin, lysozyme, 
cholesterol hydrolase, serum proteins such as serum 
albumin and proteinaceous hormones such as growth 
hormones. When recombinant DNA sequences encoding a 
homologous polypeptide are used, the transgene is 

30 preferably integrated in a random manner into the genome 
of the species used for transgenesis. Such random 
integration results in a transgenic animal which 
contains not only the transgene encoding the endogenous 
polypeptide but also the corresponding endogenous 

35 genomic DNA sequence. Accordingly, such transgenic non- 
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human mammals are readily characterized by an Increase 
in the copy number of genes encoding the endogenous 
polypeptide. Further, the transgene will generally be 
located at a position which is different from the 
5 endogenous gene. 

When DNA encoding a homologous polypeptide is expressed, 
for example, in bovine species, the transgenic animal 
is characterized by an increase in the amount of the 
homologous polypeptide in either the endogenous tissue 
10 or fluid in which it is normally found and/or by its 
presence in a tissue and/or body fluid which either does 
not normally contain the homologous polypeptide or 
produces it at significantly:: lower levels. 

Thus, for example, bovine cholesterol hydrolase is 

15 normally present in the colostrum for about the first 
15-20 days of lactation. This naturally occurring 
endogenous polypeptide increases calf weight. This 
protein, however, is also a homologous polypeptide when, 
for example, its expression in mammary secretory cells 

20 is placed under the control of expression regulation 
sequences, such as those obtained from bovine casein 
genes, which facilitate the expression of the homologous 
polypeptide beyond the lactation period that it is 
normally present. Thus, according to one aspect of the 

25 invention, bovine cholesterol hydrolase expression is 
maintained in transgenic bovine milk by placing the 
expression of cholesterol hydrolase recombinant DNA 
(either cDNA or genomic) under the control of bovine 
aSl-casein expression regulation sequences. When a 

30 genomic recombinant DNA is used, it is engineered such 
that it has appropriate restriction sites (e.g. Clal and 
Sail) at the 5 f and 3 R end of the structural gene such 
that it is capable of being inserted into an appropriate 
transgene genomic cassette ' A ~.g. p-16 Kb, CS which is 

35 described in Example 15) . Alcernatively, a recombinant 
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DNA encoding bovine cholesterol hydrolase derived from 
cDNA may be placed under control of bovine crSl-casein 
expression regulation sequence by substituting the human 
lactoferrin sequences in a plasmid such as pl6, 8HLF3 
S (containing a hybrid intervening sequence) or pl6, 8HLF4 
(containing a homologous aSl-casein intervening 
sequence) . When these particular plasmids are used, the 
cDNA clone is engineered such that it has appropriate 
Clal and Sail restriction sites at the ends of the 
10 recombinant DNA. 

By way of further example, bovine lactoferrin is 
normally present in only trace amounts in cow's milk. 
When, however, bovine lactoferrin is expressed under 
control of other regulatory sequences, for example, 

15 obtained from an aSl-casein gene, higher amounts of 
lactoferrin in the milk of transgenic bovine species are 
obtained. In another example, a transgene comprising 
DNA encoding homologous bovine growth hormone is 
incorporated into the bovine genome to confer superior 

20 growth characteristics to the transgenic animal. In 
other instances, homologous polypeptides include, for 
example, a polypeptide which normally is maintained 
intracellularly in a particular species but which is 
secreted into the milk or other extracellular 

25 compartment of the transgenic species, such as the 
circulatory system. 

Each of the heterologous or homologous polypeptides are 
characterized by specific amino acid and nucleic acid 
sequences. It is to be understood, however, that such 

30 sequences include naturally occurring allelic variations 
thereof and variants produced by recombinant methods 
wherein such nucleic acid and polypeptide sequences have 
been modified by the substitution, insertion and/or 
deletion of one or more nucleotides in such nucleic 

35 acids to cause the substitution, insertion or deletion 



WO 91/08216 



-15- 



PCT/US90/06874 



of one ore more amino acid residues in the recombinant 
polypeptide. 

When expression of the DNA of the transgene is necessary 
to generate a desired phenotype, e.g. to produce a 
5 recombinant polypeptide, the transgene typically 
includes at least a 5 1 and preferably additional 3 • 
"expression regulation sequences" each operably linked 
to a recombinant or secretory-recombinant DNA as defined 
hereinafter. Such expression regulation sequences in 
10 addition to controlling transcription also contribute 
to RNA stability and processing, at least to the extent 
they are also transcribed. 

* » 

Such expression regulation sequences are chosen to 
produce tissue-specific or cell type-specific expression 

15 of the recombinant or secretory-recombinant DNA. Once 
a tissue or cell type is chosen for expression, 5 1 and 
optional 3 1 expression regulation sequences are chosen. 
Generally, such expression regulation sequences are 
derived from genes that are expressed primarily in the 

20 tissue or cell type chosen. Preferably, the genes from 
which these expression regulation sequences are obtained 
are expressed substantially only in the tissue or cell 
type chosen, although secondary expression in other 
tissue and/or cell types is acceptable if expression of 

25 the recombinant DNA in the transgene in such tissue or 
cell type is not detrimental to the transgenic animal. 
Particularly preferred expression regulation sequences 
are those endogenous to the species of animal to be 
manipulated. However, expression regulation sequences 

30 from other species such as those from human genes may 
also be used. In some instances, the expression 
regulation sequences and the recombinant DNA sequences 
(either genomic or CDNA) are from the same species, 
&.g. , each from bovine species or from a human source. 

35 In such cases, the expression regulation sequence and 



WO 91/08216 



PCT/US90/06874 



-16- 

the recombinant DNA sequence are homologous to each 
other. Alteratively, the expression regulation 
sequences and recombinant DNA sequences (either cDNA or 
genomic) are obtained from different species, £-3-* an 
5 expression regulation sequence from bovine species and 
a recombinant DNA sequence from a human source) . In 
such cases, the expression regulation and recombinant 
DNA sequence are heterologous to each other. The 
following defines expression regulation sequences from 
10 endogenous genes. Such definitions are also applicable 
to expression regulation sequences from non-endogenous, 
heterologous genes. 

, In x general, the 5* expression regulation sequence 
includes the transcribed portion of the endogenous gene 

15 upstream from the translation initiation sequence (the 
5* untranslated region or 5 1 UTR) and those flanking 
sequences upstream therefrom which comprise a functional 
promoter. As used herein, a "functional promoter" 
includes those necessary untranscribed DNA sequences 

20 which direct the binding of UNA polymerase to the 
endogenous gene to promote transcription. Such 
sequences typically comprise a TATA sequence or box 
located generally about 25 to 30 nucleotides from the 
transcription initiation site. The TATA box is also 

25 sometimes referred to the proximal signal. In many 
instances, the promoter further comprises one or more 
distal signals located upstream from the proximal signal 
(TATA box) which are necessary to initiate 
transcription. Such promoter sequences are generally 

30 contained within the first 100 to 200 nucleotides 
located upstream from the transcription initiation site, 
but may extend up to 500 to 600 nucleotides from the 
transcription initiation site. Such sequences are 
either readily apparent to those skilled in the art or 

35 readily identifiable by standard methods. Such promoter 
sequences alone or in combination with the 5 1 



untranslated region are referred to herein as "proximal 
5 1 expression regulation sequences". 

In addition to such proximal 5" expression regulation 
sequences, it is preferred that additional 5 1 flanking 
5 sequences (referred to herein as "distal 5 1 expression 
regulation sequences*) also be included in the 
transgene. Such distal 5 • expression regulation 
sequences are believed to contain one or more enhancer 
and/or other sequences which facilitate expression of 

10 the endogenous gene and as a consequence facilitate the 
expression of the recombinant or secretory-recombinant 
DNA sequence operably linked to the distal and proximal 
5 • expression regulation sequences. The amount of 
distal 5» expression regulation sequence depends upon 

15 the endogenous gene from which the expression regulation 
sequences are derived. In general, however, such 
sequences comprise 5 • flanking regions of approximately 
Ikb, more preferably 16kb and most preferably about 30kb 
of 5 f flanking sequence. The determination of the 

20 optimal amount of distal 5» expression regulation 
sequence used from any particular endogenous gene is 
readily determined by varying the amount of distal 5 f 
expression regulation sequence to obtain maximal 
expression. In general, the distal 5' expression 

25 regulation sequence will not be so large as to extend 
into an adjacent gene and will not include DNA sequences 
which adversely effect the level of transgene 
expression. 

In addition, it is preferred that 3 f expression 
30 regulation sequences also be included to supplement 
tissue or cell-type specific expression. Such 3 1 
expression regulation sequences include 3 • proximal and 
3 • distal expression regulation sequences from an 
appropriate endogenous gene. The 3 1 proximal expression 
35 regulation sequences include transcribed but 
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untr ans lat ed DNA positioned downstream from the 
translation stop signal in the recombinant DNA sequence 
(also referred to as the 3* untranslated region or 3 9 
UTR) . Such sequences generally terminate at a 
5 polyadenylation sequence (either from the endogenous 
gene or from other sources such as SV40) and sequences 
that may affect RNA stability. Generally, 3 9 DTR f s 
comprise about 100 to 500 nucleotides downstream from 
the translation stop signal in the gene from which the 

10 3 1 regulation sequence is derived. Distal 3 1 expression 
regulation sequences include flanking DNA sequences 
downstream from the proximal 3' expression regulation 
sequence. Some of these distal sequences are 
transcribed, .but^do not form part of the mRNA while 

15 other sequences in this distal 3 9 expression regulation 
sequence are not transcribed at all. Such distal 3* 
expression regulation sequences are believed to contain 
enhancer and/ or other sequences which enhance 
expression. Such sequences are believed to be necessary 

20 for efficient polydenylation and contain transcription 
termination sequences Preferably, such sequences 
comprise about 2kb, more preferably 8kb and most 
preferably about l5kb of 3 9 flanking sequence. 

Although the use of both 5 1 and 3 • expression regulation 
25 sequences are preferred, in some embodiments of the 
invention, endogenous 3 9 regulation sequences are not 
used. In such cases, the 3' proximal expression 
regulation sequences normally associated with the 
genomic DNA encoded by the recombinant DNA sequence are 
30 used to direct polyadenylation. In addition, distal 3 9 
regulation sequences from the genomic DNA encoding the 
recombinant polypeptide may also be employed preferably 
in the same amounts as set forth for endogenous 3 9 
expression regulation sequences. In such cases, it is 
35 to be understood that the recombinant polypeptide 
encoded by the transgene may comprise either genomic DNA 
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or a double stranded DNA derived from cDNA. As with the 
5' expression regulation sequences, the optimal amount 
of 3' expression regulation sequence may be readily 
determined by varying the amount of 3 9 flanking sequence 
5 to obtain maximal expression of the recombinant 
polypeptide. In general, the distal 3 9 regulation 
sequence, be it from an endogenous gene or a 
heterologous gene, will not extend into the adjacent 
gene from which is derived and will exclude any 
10 sequences which adversely effect the level of transgene 
expression. 



Examples of expression regulation sequences core provided 
in Table I. 



TAPfcE 1 



15 



20 



25 



Expression Regulation 
Ssqusnce 

16Kb of bovine oSl 
casein 5 V to structural 
gene and 8kb 3* to 
structural gene 



«15kb 5 1 
gene 



to albumin 



«15kb 5 9 to a-actin 
gene 

«15kb upstream of 
protamine gene 



Tissue 

Specificity 

Mammary 

secretory 

cells 



Liver 



Muscle 



Spermatids 



Animal 
Species 

bovine 



murine 
murine 
murine 



In addition to the 5* and 3 9 expression regulation 
sequences and the recombinant DNA (either genomic or 
derived from cDNA) the transgenes of the invention 
30 preferably also comprise a "recombinant intervening 
sequence" which interrupts the transcribed but 
untranslated 5 9 region of the transgene. Such 
intervening sequences can be derived, for example, from 
bovine aSl-casein and from human lac t of err in. Such 
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sequences as used herein are "homologous recombinant 
intervening sequences 91 in that the 5 ' and 3 9 RNA splice 
signals in such recombinant intervening sequences are 
those normally found in an intervening sequence from an 
5 endogenous or heterologous gene. Recombinant 
intervening sequences may, however, also comprise a 
"hybrid intervening sequence" . Such hybrid intervening 
sequences comprise a 5 9 RNA splice signal and 3 1 RNA 
splice signal from intervening sequences from different 

10 sources. In some aspects of the invention, such hybrid 
intervening sequences comprise at least one "permissive 
RNA splice sequence". As used herein, a permissive RNA 
splice signal is an RNA splice signal sequence, 
preferably a 3 9 RNA splice signal, from an intron 

15 contained within a repertoire of germ line DNA segments 
which undergo rearrangement during cell differentiation. 
Examples of such gene repertoires include the 
immunoglobulin super gene family, including the 
immunoglobulins and T-cell antigen receptors as well as 

20 the repertoire of the major histocompatibility complex 
(MHC) genes and others. Particularly preferred 
permissive splice sequences are those obtained from the 
immunoglobulin repertoire, preferably of the IgG class, 
and more preferably those 3 9 splice signal sequences 

25 associated with the J-C segment rearrangement of the Jg 
heavy and light chain, most preferably the heavy chain. 
A particularly preferred permissive splice sequence 
comprises that portion of the sequence as shown 
downstream of the Hindlll site in Fig. 11. A 

30 particularly preferred hybrid intervening sequence 
comprises the entire sequence shown in Fig. 11 which 
includes a 5 9 portion of an intervening sequence from 
bovine aSl-casein and a 3 9 sequence portion of an IgG 
heavy chain intervening sequence. 

35 Such hybrid intervening sequences containing permissive 
RNA splice signals are preferably used when the 
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recombinant DNA corresponds to a cDNA sequence. As 
indicated in the Examples, when 16kb of 5' expression 
regulation sequence from the aSl-casein gene was used 
in conjunction with an oSl-casein-IgG hybrid intervening 
5 sequence to express human lactoferrin cDNA operably 
linked to the aSl-casein secretory signal sequence a 
transgenic mouse was obtained which produced 
approximately 1330 pg/ml of hLF in the transgenic milk. 
This amount of recombinant polypeptide far exceeds the 

10 previously reported amounts for production of various 
protein in transgenic mouse milk of generally less than 
10 Mg/ml and in one case approximately 50 /ig/ml. It also 
exceeds the maximum of 8/ig/ml of hLF produced herein 
when the same transgene was us&d that contained a 

15 homologous bovine intervening sequence rather than the 
hybrid intervening sequence. 

However, such hybrid intervening sequences are not 
limited to transgenes utilizing cDNA sequence. Rather, 
hybrid intervening sequences are also useful when the 

20 recombinant polypeptide is encoded by a genomic 
sequence. Based on the results obtained with the cDNA 
recombinant DNA and the general expectation that genomic 
DNA sequences express at higher levels than sequences 
derived from cDNA, it is expected that such hybrid 

25 intervening sequences used in conjunction with genomic 
recombinant DNA will further enhance expression levels 
above that which would otherwise be obtained with 
genomic sequence alone. 

Based on the foregoing, it is apparent that preferred 
30 transgenes include large amounts of 5 V and 3 V expression 
regulation sequences. Further, the recombinant DNA is 
preferably derived from genomic clones which may be tens 
to hundreds of kilobases in length. Based on the 
present technology for cloning and manipulating DNA, the 
35 construction and microinjection of transgenes is 
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practically limited to linearized DNA having a length 
not greater than about 50kb. However, the transgenes 
of the invention, especially those having a length 
greater than about 50kb, may be readily generated by 
5 introducing two or more overlapping fragments of the 
desired transgene into an embryonal target cell. When 
so introduced, the overlapping fragments undergo 
homologous recombination which results in integration 
of the fully reconstituted transgene in the genome of 

10 the target cell. In general, it is preferred that such 
overlapping transgene fragments have 100% homology in 
those regions which overlap. However, lower sequence 
homology may be tolerated provided efficient homologous 
recombination occurs. If non-homology does exist 

15 between the homologous sequence portions, it is 
preferred that the non-homology not be spread throughout 
the homologous sequence portion but rather be located 
in discrete areas. Although as few as 14 base pairs at 
100% homology are sufficient for homologous 

20 recombination in mammalian' cells (Rubnitz, J. and 
Subramani, S. (1984) Mol. Cell. Biol. 4. 2253-2258), 
longer homologous sequence portions are preferred, e.g. 
500bp, more preferably lOOObp, next most preferably 
2000bp and most preferably greater than 2000bp for each 

25 homologous sequence portion. 

As indicated in the examples, three overlapping 
fragments of the human serum albumin gene were 
micro injected into the pronuclei of mouse zygotes in 
approximately equal molar portions. These fragments 

30 successfully recombined and integrated into the mouse 
genome as confirmed by analysis of the integrated DNA 
by Southern blotting procedures and by detection of RNA 
transcript and human serum albumin in the serum of the 
transgenic mouse. Although the transgene so generated 

35 has a unit length of 3 8Kb, there is no known practical 
limit to the size of the transgene which may be formed 
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using larger and/ or greater numbers of overlapping 
transgene fragments. In particular, it is expected that 
transgenes may be formed by this approach having lengths 
between about 50 to lOOOkb and more preferably between 
5 50 and 500kb* Further, the use of homologous 
recombination of overlapping fragments is expected to 
be fruitful in the generation of larger transgenic 
animals, such as transgenic bovine species, containing 
transgenes incorporating recombinant DNA comprising 

10 genomic DNA which otherwise could not be incorporated 
into a pronucleus to form a transgenic animal* Such 
genomic transgenes are expected to produce higher 
expression levels in transgenic cows as compared to that 
Hthich is produced by transgenes encoding rc&6mbinant 

15 cDNA* 

When, the ultimate object is to secrete a recombinant 
polypeptide, a "secretory DNA sequence" encoding a 
functional secretion signal peptide is also operably 
linked within the transgene to direct secretion of the 

20 recombinant polypeptide from one or more cell types 
within the transgenic animal* Secretory DNA sequences 
in general are derived from genes encoding secreted 
proteins of the seme species of the transgenic animal. 
Such secretory DNA sequences are preferably derived from 

25 genes encoding polypeptides secreted from the cell type 
targeted for tissue-specific expression, e.g. secreted 
milk proteins for expression in and secretion from 
mammary secretory cells. Secretory DNA sequences, 
however, are not limited to such sequences. Secretory 

30 DNA sequences from proteins secreted from other cell 
types within the species of transgenic animal may also 
be used, e.g., the native signal sequence of a 
homologous gene encoding a protein secreted other than 
in the mammary glands. In addition, "heterologous 

35 secretory DNA sequences" which encode signal secretion 
peptides from species other than the transgenic animals 
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my also be used e.g., human t-PA, human serum albumin 
human lactoferrin and human lactalbumin and secretion 
signals from microbial genes encoding secreted 
polypeptides such as from yeast, filamentous fungi, and 
5 bacteria. In general, a secretory DNA sequence may be 
defined functionally as any DNA sequence which when 
operably linked to a recombinant DNA sequence encodes 
a signal peptide which is capable of causing the 
secretion of the recombinant polypeptide. 

10 In one of the preferred embodiments, a secretory DNA 
sequence encoding a secretory signal sequence functional 
in the mammary secretory cells of bovine species is used 
to payyse secretion of recombinant polypeptide^ from 
bovine mammary secretory cells. The secretory DNA 

15 sequence is operably linked to the recombinant DNA 
sequence. Examples of such secretory DNA sequences 
include DNA sequences encoding signal secretion 
sequences for bovine aSl-casein, murine lactoferrin and 
human transferrin. The preferred secretory DNA sequence 

20 is that encoding the secretory sequence of aSl-casein 
from bovine species. The use of this secretory DNA 
sequence is described in more detail in the Examples. 

"Operably linked" in the context of linking a secretory 
DNA sequence to a recombinant DNA sequence means that 

25 the secretory DNA sequence (comprising codons encoding 
the secretory signal peptide sequence) is covalently 
coupled to the recombinant DNA sequence so that the 
resultant secretory-recombinant DNA sequence encodes 5 * 
to 3 * for the secretory signal sequence and recombinant 

30 polypeptide. Accordingly, the reading frame for the 
secretory sequence and the recombinant DNA sequence must 
be covalently combined such that an open reading frame 
exists from the 5 ' end of the mRNA sequence formed after 
transcription and processing of the primary UNA 

35 transcript. This open reading frame in the RNA contains 
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a 5* sequence portion encoding the secretory signal 
peptide and a 3* sequence portion encoding the 
recombinant polypeptide. When so constructed, the 
recombinant polypeptide produced upon expression of the 
5 secretory-recombinant DNA sequence is of a form which 
is capable of being secreted from targeted cells which 
express the DNA sequence* The signal peptide generally 
is removed In Y jvp during secretion to produce an 
extracellular form of the recombinant polypeptide. 

10 In the preferred embodiments of the invention, a 
secretory-recombinant DNA sequence is expressed 
predominantly in the mammary secretory cells of 
transgenic bovine species. ic v^uch tissue-specific 
expression is obtained by operably linking mammary 

15 specific expression regulation DNA sequences to the 
above secretory-recombinant DNA sequence. Such mammary 
specific regulation sequences include the aforementioned 
regulation sequences contained in various bovine genes 
preferentially expressed in the mammary secretory cells 

20 of the species. Such mammary specific genes include 
aSl-casein; crS2 -casein; 0-casein; K-casein; 
oc-lactalbumin; and 0-lactoglobulin. Preferred 
expression regulation sequences are derived from aSl- 
casein as described more in detail in the Examples. 

25 In general, the transgenes of the invention that are 
designed to secrete the recombinant polypeptide into 
transgenic bovine milk are capable of causing such 
secretion at levels significantly higher than that 
previously reported for transgenic mice and sheep* When 

30 the recombinant polypeptide is encoded by a recombinant 
DNA corresponding to, or derived from, cDNA, the molar 
concentration of the recombinant polypeptide is 
preferably greater than about 1.0 /iK, more preferably 
greater than about 100 pM, and most preferably greater 

35 than 100 /iM. When viewed from the perspective of the 
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level of recombinant polypeptide present in the 
transgenic milk, the amount of recombinant polypeptide 
is preferably greater than 50 /ig/ml, more preferably 
greater than about 500 /xg /ml and most preferably greater 
5 than about 1000 pg/ml (Img/ml) . 

When the transgene of the invention encodes a 
recombinant polypeptide that is encoded by recombinant 
DNA derived from or corresponding to genomic DNA (or 
comprised substantially of such genomic sequences, e.g. 

10 greater than about 50%, more preferably greater than 
about 75%, most preferably greater than 90% of the 
codons encoding the recombinant polypeptide are from 
genomic sequences) , tjie molar concentrations and protein 
levels in bovine transgenic milk are the same as for 

15 cDNA or higher. In general, the molar concentration 
of the recombinant polypeptide in such transgenic milk 
is preferably greater than about 50 /iM f more preferably 
greater than about 150 pM, most preferably greater than 
about 500 [Mm When viewed from the level of protein in 

20 the transgenic milk, the levels are preferably greater 
than about 10 mg/ml, more preferably greater than about 
2.5 mg/ml, most preferably greater them 5 mg/ml. 

The foregoing molar concentration and protein levels in 
bovine transgenic milk will vary depending upon the 

25 molecular weight of the particular recombinant 
polypeptide. A particular advantage of producing a 
recombinant polypeptide in bovine transgenic milk is 
that relatively large molecular weight polypeptides may 
be so produced which are otherwise difficult to produce 

30 in large quantities in other systems such as prokaryotic 
expression systems. Although any recombinant 
polypeptide may be produced in bovine transgenic milk 
according to the invention, it is generally preferred 
that such recombinant polypeptides have a molecular 

35 weight greater than about 10,000 Daltons. However, 
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other recombinant polypeptides having molecular weights 
of greater than 15,000, greater than 20,000 and greater 
than 60,000 Daltons may also be expressed in transgenic 
bovine milk. For example, human lysozyme having a 
5 molecular weight of 17,000 Daltons and lactoferrin 
having a molecular weight of 79,000 Daltons may be 
readily produced in the transgenic milk of bovine 
species according to the disclosure of the invention. 
Thus, the recombinant polypeptides of the invention have 
10 a wide range of molecular weights. 

As a consequence, the foregoing preferred molar 
concentrations of recombinant polypeptides are adjusted 
when higher molecular weight recombinant polypeptides 
are produced. Such adjustment is made by converting the 
15 molar concentration to the amount of protein produced 
and adjusting the molar concentrations so that the 
recombinant protein level is within the following 
preferred concentrations. 

Host of the previous reports relating to the production 

20 of polypeptides in transgenic milk involve transgenic 
mice. The mouse, however, normally produces between 55 
to 80 milligrams of protein per ml of milk. A cow, on 
the other hand, normally produces between 30 to 34 
milligrams of protein per ml. Since exceptionally high 

25 levels of recombinant polypeptide production may 
adversely affect the production of endogenous milk 
protein and/ or have adverse effects upon the mammary 
secretory gland, it is preferred that the recombinant 
polypeptide concentration be between about 3 and 50% of 

30 the normal bovine milk protein concentration , 
between about 1 and 17 milligrams of recombinant 
polypeptide per ml of transgenic milk) , more preferably 
between 10 to 20% (A.fi. , between 3 to about 7 milligrams 
per ml) and most preferably between 10 and 15% (A.fi. , 

35 between about 3 and 5 milligrams per ml) of the normal 
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amount of protein produced in bovine milk* Such 
preferred ranges also provide a preferred maximum limit 
to the aforementioned levels of protein produced in 
transgenic bovine milk. 

5 The above described linking of various DNA sequences to 
form the transgene of the invention are performed by 
standard methods known to those skilled in the art or 
as described herein. Once the transgene or overlapping 
homologous fragments encoding the transgene are 
10 constructed as described they are used to make 
transgenic non-human animals. 

Methods of introducing transgenes or overlapping 
transgene fragments into embryonal target cells include 
microinjection of the transgene into the pronuclei of 

15 fertilized oocytes or nuclei of ES cells of the non- 
human animal. Such methods for murine species are well 
known to those skilled in the art. Alternatively, the 
transgene may be introduced into an animal by infection 
of zygotes with a retrovirus containing the transgene 

20 (Jaenisch, R. (1976) , Proc. Natl. Acad. Sci. USA. n, 
1260-1264). The preferred method is microinjection of 
the fertilized oocyte. In this preferred embodiment, 
the fertilized oocytes are first microinjected by 
standard techniques. They are thereafter cultured 4& 

25 vitro until a "pre- implantation embryo" is obtained. 
Such pre-implantation embryos preferably contain 
approximately 16 to 150 cells. The 16 to 32 cell stage 
of an embryo is commonly referred to as a morula. Those 
pre-implantation embryos containing more than 32 cells 

30 are commonly referred to as blastocysts. They are 
generally characterized as demonstrating the development 
of a blastocoel cavity typically at the 64 cell stage. 
Methods for culturing fertilized oocytes to the pre- 
implantation stage include those described by Gordon, 

35 et al. (1984), Methods in Enzvmolocrv. ±Q±, 414; Hogan, 
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et al. (1986) in Manipulating the Hn^Pff ETCfaTYfrr Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor , N.Y. 
(for the mouse embryo) ; and Hammer, et al. (1985) , 
Nature . 315 . 680 (for rabbit and porcine embryos) 
5 Gandolfi et al. (1987) J. Reprod. Fert . 41, 23-28; 
Rexroad et al. (1988) J. Anim. Sci . ££, 947-953 (for 
ovine embryos) and Eyes tone, W.H. et al. (1989) , J. 
Reprog, Fgrttf S5# 715-720; Camous., et al. (1984), 
Reprod. Fert. . 72 P 779-785; and Heyman, Y., et al. 

10 (1987) , Ther ioqenoloav . 27 . 5968 (for bovine embryos) . 
Such pre-implantation embryos are thereafter transferred 
to an appropriate female by standard methods to permit 
the birth of a transgenic or chimeric animal depending 
upon the stage of development! when the transgene is 

15 introduced. As is well known, mosaic animals can be 
bred to form true germline transgenic animals. 

Since the frequency of transgene incorporation is often 
low, the detection of transgene integration in the pre- 
implantation embryo is highly desirable. In one aspect 

20 of the invention methods are provided for identifying 
embryos wherein transgenesis has occurred and which 
permit implantation of transgenic embryos to form 
transgenic animals. In this method, one or more cells 
are removed from the pre-implantation embryo. When 

25 equal division is used, the embryo is preferably not 
cultivated past the morula stage (32 cells) • Division 
of the pre-implantation embryo (reviewed by Williams et 
al. (1986) Therioqenoloqy 22 . 521-531) results in two 
"hemi-embryos" (hemi-morula or hemi-blastocyst) one of 

30 which is capable of subsequent development after 
implantation into the appropriate female to develop jn 
u^ero to term. Although equal division of the pre- 
implantation embryo is preferred, it is to be understood 
that such an embryo may be unequally divided either 

35 intentionally or unintentionally into two hemi-embryos 
which are not necessarily of equal cell number. 
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Essentially, all that is required is that one of the 
embryos which is not analyzed as hereinafter described 
be of sufficient cell number to develop to full term in 
utero . In a specific embodiment, the hemi-embryo which 
5 is not analyzed as described herein, if shown to be 
transgenic, is used to generate a clonal population of 
transgenic non-human animals. 

One of each of the hemi-embryos formed by division of 
pre- implantation embryos is analyzed to determine if the 

10 transgene has been integrated into the genome of the 
organism. Each of the other hemi-embryos is maintained 
for subsequent implantation into a recipient female of 
the species. A preferred method for detecting 
transgenesis at this early stage in the embryo's 

15 development uses these hemi-embryos in connection with 
a unique property of the restriction endonuclease Dpn 
X- This enzyme recognizes the sequence GATC in double- 
stranded DNA but only when the adenine in each strand 
within this sequence is methylated at N-6. When using 

20 this preferred method, the transgene containing the 
sequence GATC is methylated prior to microinjection 
either by transferring the transgene on an appropriate 
plasmid through a DAM* strain of microorganisms such as 
JL. coli MM294 or by directly methylating the transgene 

25 with dam methylase. The methylated transgene 
(preferably without any exogenous sequences such as 
plasmid vector) is then microinjected into fertilized 
oocytes (approximately 10 to 500 copies per pronucleus, 
more preferably 50 to 100 copies per pronucleus) • The 

30 fertilized oocytes so obtained are cultured in vitro to 
the pre-implantation stage. During this early growth 
and cell division phase, the genomic DNA is replicated. 
Accordingly, those copies of the methylated transgene 
integrated into the genome of the fertilized oocyte are 

35 unmethylated after replication whereas any non- 
integrated transgenes which may still exist after 
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replication will remain methylated. (Lacks, S. # et al. 
(1977), J. Mol. Biol. . HA, 153.) This differential 
methylation pattern for integrated versus non-integrated 
transgene permits the identification of which fertilized 
5 oocytes have integrated the transgene into the genome. 

The identification of the pre-implantation embryos 
containing the integrated transgene is achieved by 
analyzing the DNA from each of the hemi-embryos . Such 
DNA is typically obtained by lysing the hemi-embryo and 

10 analyzing the thus released DMA after treatment as 
described by Ninomiy, T. et al. (1989) molecular 
Reproduction and D evelopment 1, 242-248. Each of the 
DNA samples is treated with Dpn I * Thereafter, a 
polymerase chain reaction (Saiki, '~et al. (1985), 

15 science . 230 . 1350-1354) is preformed to amplify all or 
part of the transgene. When the entire transgene is 
amplified, two extension primers each complimentary to 
opposite strands at opposing ends of the transgene are 
used for amplification. When, however, less than the 

20 entire transgene is amplified, such extension primers 
are chosen such that the amplified gene product spans 
the Dpn I site in the transgene. If fipnJL cleavage has 
not occurred, PGR amplification results in amplified 
sequences having a predetermined size whereas primer 

25 extension for those transgenes which have been cleaved 
will not result in exponential amplification. 
Generally, the Dpn I/PCR amplified DNA from the hemi- 
embryo is subjected to electrophoresis followed by 
hybridization with labeled probe complimentary to the 

30 region of the transgene between the two extension 
primers. This facilities the determination of the size 
of the amplified DNA sequences, if any, and provides an 
indication of whether the transgene has been integrated 
into the pre-implantation embryo from which the hemi- 

35 embryo was obtained (now called a "transgenic hemi- 
embryo") . If it has, the remaining untreated transgenic 
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hemi-embryo is transplanted into a recipient parent. 
Aft^r in utero development, the transgenic non-human 
animal having the desired phenotype conferred by the 
integrated transgene is identified by an appropriate 
5 method in utero or after birth. Of course, other 
restriction endonucleases capable of cleaving a 
methylated DNA sequence but incapable of cleaving the 
unmethylated form of a recognition sequence may be used 
in the aforementioned method. 

10 The above described method using Don I requires that the 
sequence GATC be present in the transgene of interest. 
In those cases when such a sequence is not present, it 
may be readily introduced into the transgene by site 
directed mutagenesis (Kunkel, T.A. (1985) , Proc. Natl. 

15 Acad. Sci» . 488) or cassette mutagenesis (Wells, 

J. A., et al. (1985), Gene r 315) provided such 

mutagenesis does not change the amino acid sequence 
encoded by the transgene (or causes an inconsequential 
change in amino acid sequence) and that any codons so 

20 generated are functional in the transgenic non-human 
animal of interest. 

The above described methods for the detection of 
transgenesis in pre-implantation embryos provide 
economical and time saving method for generating 

25 transgenic non-human animals since they significantly 
decrease the number of pregnancies required to produce 
a transgenic animal and substantially increase the 
likelihood that an implanted embryo will produce a 
transgenic non-human animal. Such methods are 

30 especially important for those animals for which very 
low or non-existent frequencies of transgenesis have 
been obtained, e.g. bovine species. 

In an alternate embodiment, the above described method 
for detecting transgenesis in pre-implantation embryos 



WO 91/08216 



-33- 



PCT/USW/06874 



is combined with embryonic cloning steps. to generate a 
clonal population of transgenic embryos which may 
thereafter be implanted into recipient females to 
produce a clonal population of transgenic non-human 
5 animals also having the same genotype. In this regard, 
it is to be understood that transgenic embryos and/or 
non-human transgenic animals having the same "genotype" 
means that the genomic DNA is substantially identical 
between the individuals of the embryo and/ or transgenic 

10 animal population. It is to be understood, however, 
that during mitosis various somatic mutations may occur 
which may produce variations in the genotype of one or 
more cells and/or animals. Thus, a population having 
the same genotype may demonstrate individual or 

15 subpopulation variations. * - 

After a hemi-embryo is identified as a transgenic hemi- 
embryo, it is cloned. Such embryo cloning may be 
performed by several different approaches. In one 
cloning method, the transgenic hemi-embryo is cultured 

20 in the same or in a similar media as used to culture 
individual oocytes to the pre- implantation stage. The 
"transgenic embryo" so formed (preferably a transgenic 
morula) is then divided into "transgenic hemi-embryos" 
which can then be implanted into a recipient female to 

25 form a clonal population of two transgenic non-human 
animals. Alternatively, the two transgenic hemi-embryos 
obtained may be again cultivated to the pre-implantation 
stage, divided, and recultivated to the transgenic 
embryo stage. This procedure is repeated until the 

30 desired number of clonal transgenic embryos having the 
same genotype are obtained. Such transgenic embryos may 
then be implanted into recipient females to produce a 
clonal population of transgenic non-hum^ n animals. 

In a preferred cloning method, the transgenic embryo is 
35 cloned by nuclear transfer according to the techniques 



r 
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of Prather et al. (1988) Biol, Reorod . 37; 59-86; Roble 
et al- (1987) J. Anim. Set . 6£, 642-664. According to 
this method, nuclei of the transgenic embryo are 
transplanted into enucleated oocytes, each of which is 
5 thereafter cultured to the blastocyst stage. At this 
point, the transgenic embryos may be resubjected to 
another round of cloning by nuclear transplantation or 
may be transferred to a recipient parent for production 
of transgenic offspring having the same genotype* 

10 In addition to the foregoing methods for detecting early 
transgenesis, other methods may be used to detect 
transgenesis. Such methods include in utero and post 
partum analysis of tissue* In utero analysis is 
performed by several techniques. In one, transvaginal 

15 puncture of the amniotic cavity is performed under 
echoscopic guidance (Bovgso et al. (1975) Bet. Res . 96 . 
124-127; Ramsey et al. (1974) J. Anim. Sci . ?9 f 
386-391) . This involves recovering about 15 to 20 
milliliters of amniotic fluid between about day 35 and 

20 day 100 of gestation. This volume of amniotic fluid 
contains about 1000 to 12,000 cells per ml originating 
from the urogenital tract, the skin and possibly the 
lungs of the developing embryo. Host of these cells are 
dead. Such cells, however, contain genomic DNA which 

25 is subjected to PCR analysis for the transgene as an 
indication of a successful transgenesis. Alternatively, 
fetal cells may be recovered by chorion puncture. This 
method also may be performed transvaginally and under 
echoscopic guidance. In this method, a needle is used 

30 to puncture the recipient animal's placenta, 
particularly the placentonal structures, which are fixed 
against the vaginal wall. Such sampling may be 
performed around day 60 of gestation in bovine species. 
Chorion cells, if necessary, are separated from maternal 

35 tissue and subjected to PCR analysis for the transgene 
as an indication of successful transgenesis. 
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Transgenesis may also be detected after birth* In such 
cases, transgene integration can be detected by taking 
an appropriate tissue biopsy such as from the ear or 
tail of the putative transgenic animal* About one to 
5 two centimeters of tail or about five to ten square 
millimeters of ear are obtained followed by southern 
blotting with a probe for the transgene according to the 
method of Hogan et ai. (1986) Manipulating ths ttwge 
ErofrrV9r Cold Spring Harbor Laboratory. 

ID In those embodiments where a recombinant polypeptide is 
expressed and secreted into the milk of transgenic 
bovine species , the transgenic milk so obtained may be 
either used as is or further treated to purify the 
recombinant polypeptide. This depends, in part, on the 

15 recombinant polypeptide contained in the transgenic milk 
and the ultimate use for that protein. Thus, when the 
recombinant polypeptide is secreted into transgenic milk 
to increase the nutritional value of the bovine milk, 
no further purification is generally necessary. An 

20 example of such a situation involves one of the 
preferred embodiments wherein human lactoferrin is 
produced in the milk of bovine species as a supplement 
to control intestinal tract infections in newborn human 
infants and to improve iron absorption. In other 

25 situations, a partial purification may be desired to 
isolate a particular recombinant polypeptide for its 
nutritional value. Thus, for example, human lactoferrin 
produced in transgenic bovine milk may be partially 
purified by acidifying the milk to about pH 4-5 to 

30 precipitate caseins. The soluble fraction (the whey) 
contains the human lactoferrin which is partially 
purified. 



The recombinant polypeptide contained in bovine 
transgenic milk may also be used in food formulations. 
35 A particularly useful food formulation comprises an 
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Infant f ormula containing one or more recombinant 
polypeptides from transgenic bovine milk which have 
either nutritional or other beneficial value. For 
example, an infant formula containing human lactof errin 
5 from transgenic bovine milk made according to the 
present invention provides a bacteriostatic effect which 
aids in controlling diarrhea in newborn. Similarly, 
recombinant polypeptides such as human casein and human 
lysozyme may also be generated in transgenic bovine milk 

10 to provide nutritional value. Table 2 sets forth the 
constituents of a typical infant formula. As indicated 
therein, the protein content varies between about 1.8 
and 4.5 grains of protein per 100 kilocalories of 
formula. Thus, the total protein including recombinant 

15 polypeptide should lie between the values at least based 
on regulatory requirements in the United States from 
which the formulation in Table 2 is based. The amount 
of total protein including recombinant polypeptide, of 
course, may vary from the -foregoing depending upon the 

20 local regulations where the particular formula is 
intended to be used. 
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TABLE 2 



Nutrient 



Protein (gn) 1 

Fat: 
m 

percent eel 

Essential fatty acids (l(noleats): 

percent cal 



HlnW 



Maximum* 



3.3 
30.0 



2.7 
300.0 



4.5 



6.0 
54.0 



Vitamins: 
(A) <IU) 
D CIU) 
K <M> 
E €1U> 

C (ascorbic acid (me) 
B, (thiamine (*g) 
Bj (riboflavin) (*g) 
B 4 (pyridoxins) (*g> 

B, 2 (*g) 
Niacin (sg) 
Folic acid Cog) 
Pantothenic acid ( g) 
Biotin (so) 
Choline (mg) 
Inositol fag) 

Minerals: 
Calciua (mg) 
Phosphorus (mg) 
Nagnesium (mg) 
Iron (mg) 
Iodine (*g> 
Zinc (mg) 
Copper (ag) 
Manganese (ag) 
Sodium (mg) 
Potassium (mg) 
Chloride (mg) 



250,0 (75 ag)" 
40.0 
4.0 

0.7 (with 0.7 lU/gm 
lineolelc acid) 

0.0 
40.0 
60.0 

35.0 (with 15 M/gm of 

protein in formula) 

0.15 
250.0 

4.0 
300.0 

1.5* 

7.0* 

4.0* 



50.0" 
25.0" 
6.0 
0.15 
5.0 
0.5 
60.0 
5.0 
20.0 
80.0 
55.0 



750.0 (225 ag)' 
100.0 



60.0 
200.0 
150.0 



"Stated per 100 M localorles. 

'The source of protein shall be at least nutritionally equivalent to casein. 
"Retlnol equivalents. 

"Required to be Included In this amount only in formulas which are not milk-based. 
"Calcium to phosphorus ratio must be no less than 1.1 nor more than 2*0. 
'includes recombinant protein according to the invention or recombinant proteins and 
other proteins. 



WO 91/08216 PCT/US90/06874 

-38- 

In addition to infant formulas, other food formulations 
may also be supplemented with recombinant polypeptides 
from transgenic bovine milk* For example, such 
recombinant polypeptides may be used to supplement 
5 common diet formulations. 



When the recombinant polypeptide is intended to be used 
pharmaceutically, purification methods consistent with 
such an application are called for. Such purification 
methods will depend on the particular recombinant 

10 polypeptide to be purified and are generally known to 
those skilled in the art. Such methods typically 
include a partial purification by casein fractionation 
followed by chromotography of the appropriate fraction 
containing the recombinant polypeptide * Such 

15 chromotography includes affinity chromotography, ion 
exchange chromotography, gel filtration and HPLC. 

In a specific embodiment of the invention, transgenes 
are provided for producing human lactof err in in the milk 
of transgenic bovine species. Human lactof err in (HLF) 

20 is a single chain glycoprotein which binds two ferric 
ions. Secreted by exocrine glands (Mason et al. (1978) 
J, Clin. Path . 21, 316-327; Tenovuo et al. (1986) 
Infect. Immun . 51 . 49-53) and polymorphonuclear 
neutrophil granulocytes (Mason et al. (1969) J. Exp. 

25 Med . 130 . 643-658) , this protein functions as part of 
a host non-specific defense system by inhibiting the 
growth of a diverse spectrum of bacteria. HLF exhibits 
a bacteriostatic effect by chelation of the available 
iron in the media, making this essential metal 

30 inaccessible to the invading microorganisms (Bullen et 
al. (1972) Br. Med. J . l, 69-75; Griffiths et al. (1977) 
Infect- Immun - 1£, 396-401; Spik et al. (1978) 
Immunology &, 663-671; Stuart et al. (1984) Int. J. 
Biochem . 1£, 1043-1947) . This effect is blocked if the 

35 protein is saturated with ferric ions. Several studies 
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suggest that HLF displays a direct bacteriocidal effect 
on certain microorganisms (Arnold et al. (1980) Jnffect* 
Immun . 2&, 893-898; Arnold et al. (1977) ggjfflC? 122/ 
263-265; Arnold et al. (1981) infect. Immun. 2Z, 

5 655-660; Arnold et al. (1982) Infest* EffiUn- 15# 

792-797; Bortner et al. (1986) Infect. Immun. 51, 
373-377). The bacteriocidal effect is also inhibited 
by iron saturation of the protein. No mechanism for the 
bactericidal effect of HLF has been postulated, although 
10 it has been demonstrated that it can damage the outer 
membrane and alter outer membrane permeability in gram- 
negative bacteria (Ellison et al. (1988) Infect i 
5£, 2774-2781). 

Lactoferrin is the major iron binding <proieim in human 

15 milk (present at a concentration of about 1.5-1.7 mg/ml) 
and may play a role in the absorption of iron by the 
small intestine. All of the iron present in breast milk 
is thought to be bound to hLF and is taken up at very 
high efficiencies compared to formula (Hide, D.W. , et 

20 al. (1981), Airch. Pis. Child.. 5£, 172). It has been 
postulated that the high uptake of the hLF bound iron 
is due to a receptor in the jejunum and data has been 
presented suggesting existence of receptors in Rhesus 
monkeys (Cox, et al. (1979), BEA, Sfift, 120; Davidson, 

25 L.A., et al. (1985), Fed. Proc. , 901). There is 

also evidence for specific lactoferrin receptors on 
mucosal cells of the small intestine of human adults 
(Cox, et al. (1979) Biochem. Biophvs. Acta. 5S&# 120- 
128) . Free iron levels have been implicated in the 

30 control of the intestinal flora (Mevissen-Verhage, et 
al. (1985), Eur. J. Cl in. Microbiol., £, 14). Breast 
fed infants, compared with infants fed cow»s milk, with 
and without added iron, were shown to have substantially 
reduced coliform and, elevated bifidobacteria and 

35 Clostridia counts in fecal samples. In in vitro 
studies, human milk has been shown to have a specific 
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inhibitory effect on IL_ coli (Brock, et al. (1983), 
Infect, ^nfl iTfff nunit. . 40, 453). Human milk has also 
been shown to have a specific inhibitory effect on E. 
Sali. in small intestine due to its high content of iron 
5 binding protein, predominantly hLF (Bullen, et al. 
(1972), British m$r Jt , lr 69). 

Thus, the production of human lactoferrin in the milk 
of transgenic bovine species provides a source of human 
lactoferrin. Such lactoferrin may be purified from the 

10 transgenic milk for formulation purposes. 
Alternatively, the whole transgenic milk may be used, 
preferably after pasteurization, in either liquid or 
dried form. In addition, the beneficial action of human 
lactoferrin may be potentiated by combining the human 

15 lactoferrin or the transgenic milk containing it with 
human lysozyme. The human lysozyme may be 
simultaneously produced in the transgenic cow by 
introducing a second transgene simultaneously with the 
HLF transgene to produce a transgenic cow capable of 

20 producing more than one recombinant polypeptide in the 
transgenic milk. Alternatively, the transgenes may be 
sequentially introduced into bovine species. When such 
is the case, a transgenic bovine species is obtained 
containing one of the transgenes. Thereafter, embryonic 

25 cells, such as eggs, are obtained from the transgenic 
female and treated so as to incorporate the second 
transgene encoding the second polypeptide. Preferably, 
the egg is fertilized, followed by microinjection of the 
pronucleus of the zygote so obtained. It is to be 

30 understood that the foregoing combination of more than 
two recombinant polypeptides in transgenic bovine milk 
is not limited to the aforementioned human lactoferrin 
and lysozyme combination. Thus, the invention 
contemplates the production of transgenic bovine species 

35 and transgenic milk wherein more than one recombinant 
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polypeptide is produced by such a transgenic animal in 
the transgenic milk. 

The complete amino acid sequence of HLF has been 
determined (Metz-Boutigue et al. (1984) Eur. J. Biochem. 
5 1451 . 659-676). HLF comprises two domains, each 
containing one iron-binding site and one N-linked 
glycosylation site. These domains show homology between 
each other, indicative of an ancestral gene duplication 
and fusion event. In addition, HLF shares extensive 

10 homology with other members of the transferrin family 
(Metz-Boutigue, supra : Pentecost et al. (1987) ia-iiifil*. 
Chem . 262 . 10134-10139) . Location of the amino acids 
involved in the iron-binding sites has been determined 
by X-ray crystallography "(Anderson et al. (1987) ?rQ<? t , 

15 Natl. Acad. Sci . M, 1769-1773). A partial cDNA 
sequence for neutrophil HLF was published by Rado et al. 
(1987) Blood 70 , 989-993. There was a >98% agreement 
between the amino acid sequence deduced from the cDNA 
and that which was determined by direct analysis of 

20 lactoferrin from human milk. The structure of the iron- 
saturated and iron-free form of human lactoferrin have 
recently been published. (Anderson, fit (1989) J*. 

Mol. Biol . 209 . 711-734; Anderson, efcal. (1990) fiatoare, 
784-787.) 

25 As used herein, "human lactoferrin" comprises a 
polypeptide having the amino acid sequence substantially 
as described by Metz-Boutigue, et al. (1984), Burt Ji 
Biochem. . 1451 . 659-676 and as set forth in Fig. 2. It 
is noted, however, that an earlier partial sequence of 

30 the human lactoferrin sequence disclosed a number of 
discrepancies between the published sequence and that 
obtained herein. Specifically, the following 

discrepancies exist (amino acid numbering is from the 
sequence in Figure 1 with DNA position in parenthesis) : 
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Aminp A<?i4 




In Metz-Bouticrue 


Arg 


122 (418) 


Absent 


Thr 


130 (442) 


He 


Gin 


151 (505) 


Arg 


Ser 


184 (604) 


Leu 


Tyr 


189 (619) 


Lys 


Ser 


372 (1169) 


TrP 


itween Ala 






and Met 


391 (1122) 


13 amino acids 


Cys 


403 (1225) 


Gly 


Gin 


512 (1588) 


Glu 


Lys 


675 (2077) 


Arg 



Accordingly, human lactoferrin is also defined by the 
sequence shown in Figure 1 which combines the sequence 

15 differences obtained herein with the published sequence. 
The term human lactoferrin also includes allelic 
variations of either of these sequences or recombinant 
human lactoferrin variants wherein one or more amino 
acids have been modified by the substitution, insertion 

20 or deletion of one or more amino acid residues. In some 
instances human lactoferrin may be produced in milk with 
all or part of a secretory signal sequence covalently 
attached thereto. 

As used herein, a "human lactoferrin DNA sequence 9 * is 
25 a DNA sequence which encodes human lactoferrin as 
defined above. Such a human lactoferrin DNA sequence 
may be obtained from a human mammary gland cDNA library 
or may be derived from the human genome. Example 2 
herein describes the cloning and nucleotide sequence of 
30 human lactoferrin derived from a human mammary gland 
cDNA library. The DNA sequence of this human 
lactoferrin is shown in Fig. 1 and Fig. 2 and is 
substantially the same as that described by Rado, et al. 
(1987), Blood . ZSl, 989-993. The construction of 
35 plasmids containing an expressible transgene encoding 
hLF is described in the examples. One of these plasmids 
is cGPlHLF also sometimes referred to as 16,8HLF3) 
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contains a transgene designed for tissue-specific 
expression in bovine mammary secretory cells. 

In a second embodiment of the invention, transgenes are 
provided for producing human serum albumin in the milk 
5 of transgenic bovine species. Human serum albumin is 
a serum protein which contains 584 amino acid residues 
(Minghetti, et al. (1986) , J. Biol. Chem. . 2£1, 6747) . 
It is the most abundant protein in human serum and 
performs two very important physiological functions. 
10 Serum albumin is responsible for about 80% of the total 
osmolarity of blood and it transports fatty acids 
between adipose tissues* 



Human serum albumin id' used 'primarily to expand plasma 
volume by restoring osmotic pressure in the circulatory 

15 system. Currently, a heat treated serum derived hSA 
fraction is infused in most shock and trauma victims, 
including most of the patients undergoing extensive 
surgery. HSA is presently derived from human blood 
plasma as a by-product from blood fractionation 

20 processes to obtain rare blood proteins such as factor 
VIII and IX. The recently developed technology of 
producing such factors by biotechnological means , 
however, threatens the source of human serum albumin. 



As used herein "human serum albumin 9 * comprises a 
25 polypeptide having the amino acid sequence substantially 
as that described by Minghetti, et al.. Ibid; Lawn, et 
al. (1981), Kucl. Acids Res. . S, 6103. Also included 
are variations thereof including recombinant human serum 
albumin variants wherein one or more amino acids have 
30 been modified by the substitution, insertion or deletion 
of one or more amino acid residues. (Minghetti et al. 
(1986) J. Biol. Chem . ZSli 6747-6757.) In some 
instances, human serum albumin may be produced in milk 
by expressing a transgene which contains DNA encoding 
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the secretory signal sequence of hSA. Alternatively, 
human serum albumin may be produced in and secreted from 
liver cells of a transgenic animal utilizing a 
completely heterologous transgene comprising human 
5 genomic DNA encoding 5 9 expression regulation sequences, 
the human serum albumin secretion signal and structural 
gene and 3 V expression regulation sequences. As 
indicated in the Examples, transgenes containing this 
heterologous sequence were formed by in vivo homologous 
10 recombination of overlapping transgene fragments to 
reconstitute the hSA gene in the transgenic animal. The 
so formed transgenic animal produced human serum albumin 
in its circulatory system. 

As used herein, a "human serum albumin DNA sequence 1 * is 
a DNA sequence which encodes human serum albumin as 
defined above. Such a human serum albumin DNA sequence 
may be obtained from XHAL-HAI, XHAL-3W and XHAL-HI4 as 
described by Urano et al. (1986) J. Biol. Chem, 261 . 
3244-3251 and Urano et al. (1984) Gene 32 . 255-261 and 
in the Examples herein. 

The human serum albumin DNA sequence was cloned as 
described in Example 10 herein and subsequently 
manipulated to substitute for the human lactof errin gene 
encoded in plasmid cGPlHLF (also referred to as 
25 pl6,8HLF4). From this plasmid a transgene is obtained 
containing 16kb of the 5 1 expression regulation sequence 
of the bovine aSl-casein gene, human serum albumin DNA 
sequence and approximately 8Jcb of the 3 1 -flanking region 
of the aSl-casein bovine gene. This transgene is used 
30 to micro inject fertilized oocytes from bovine species. 
After early detection of transgenesis, blastocysts 
containing the hSA transgene are implanted into a 
recipient female bovine species and brought to term. 
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The following is presented by way of example and is not 
to be construed as any limitation on the scope of 
the invention. 

EXAMPLE 1 

5 Construction of a probe specific for 
bovine ftSl-case in sequences, 

A. Tsolation of Chromosomal DNA 

Placental tissue was obtained from the slaughterhouse. 

10 Surrounding connective tissue was removed and pieces of 
about 30 grams were quickly frozen in liquid Nj. 
Chromosomal DNA was isolated as follows: 30 grams of 
tissue was homogenized (on ice) with 35 ml of Buffer 1 
containing 300 mM Sucrose; 60 mM KC1; 15 mM NaCl; 60 

15 Tris.HQl pH 8.2; 0.5 mM spermidine; 0.15 mM, spefl&ine; 
2 mM EDTA; 0.5 mM EGTA. 65 ml of icecold buffer 1 
containing 1% NP40 was added and the mixture was 
incubated for five minutes on ice. After centrifugation 
for five minutes at 3000 xg the pellet was rinsed with 

20 buffer 1 containing 1% NP40. After repeating the 
centrifugation step the pellet was resuspended in 5 ml 
of buffer 1. 5 ml 0.5 M EDTA was quickly added. Final 
volume was now 15 ml. 0.15 ml of a 10% SDS solution was 
added. After mixing, RNAse A and Tl were added to final 

25 concentrations of 0.4 mg/ml and 6 u/ml respectively. 
After incubation at 37°C for three hours, Proteinase K 
was added to a final concentration of 0.1 mg/ml. This 
mixture was incubated for 15 hours at 37°C. The mixture 
was then carefully extracted with phenol. The aqueous 

30 phase was isolated and 1/30 volume of 3M NaOAc pH 5.2 
and one volume of isopropylalcohol was added. The 
precipitate (DNA) was rinsed with 70% ethanol and slowly 
dissolved in 0.5 ml of 10 mM Tris.HCl pH 8.0; 1 mM EDTA, 
at 4°C. 
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B. Amplification of Sequences from the 

5' -flanking Region of the aSl-casein Gene 

Two DNA-primers were synthesized based on the sequence 
published by Yu-Lee et al., (1986) Nucl. Acids Res . 14 . 
5 1883-1902. Primer 1 was located at position-681 
relative to the major transcription initiation site and 
had the following sequence: 

5 • -TCC ATG GGG GTC ACA AAG AAC TGG AC-3 1 . 
(Seq. ID No.: 5) 

10 Primer #2 was located at position +164 relative to the 
major transcription initiation site and had the 
following sequence: 5 1 -TGA AGC TTG CTA ACA GTA TAT CAT 
AGG-3 1 (Seq. ID. No.: 6). The first eight nucleotides 
of this primer are not encoded by the bovine genome, but 

15 contain a Hindlll restriction site to facilitate 
subsequent cloning steps. These primers were annealed 
to the chromosomal DNA and extended in the presence of 
deoxynucleotides by TAQ-polymerase. After three minutes 
the mixture was denatured for one minute at 92 °C, 

20 reannealed at 50 °C for 1.5 minutes and again incubated 
at extension temperature (68°C) for 2 minutes. This 
cycle was repeated 30 times. After the last cycle DNA 
was checked for the presence of the expected EcoRI 
sites. Both the size of the fragment and the presence 

25 of EcoRI sites was as expected. The fragment was then 
treated with Klenow enzyme to repair any overhanging 
ends, treated with kinase to attach phosphate groups at 
the ends of the fragment, incubated at 65 9 C for 10 
minutes to inactivate the kinase and klenow enzymes and 

30 finally digested with Hindlll. This fragment was then 
subcloned in pUC19 (Yanisch-Perron, et al. (1985) , Gene , 
3?, 103-109) digested with Smal and Hindlll. Formal 
proof of the identity of this fragment was obtained by 
sequencing parts of this subclone (after re-cloning into 

35 H13 vector) • The determined sequence was identical to 
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the published sequence. This probe was then used to 
screen a bovine genomic library to obtain clones 
specific for the 5' -flanking region of the aSl-casein 
gene. 

5 c. Amplification of Sequences from the 

3 '-flanki ng ^gio" ° f the <*Sl-casein Gene 

A similar approach was taken as described above. Two 
primers were designed based on the sequence published 
by Stewart et al (1984) 1»V«1 - Acids Res. J£, 3895-3907. 
10 The 5 » -primer was located just downstream of the coding 
sequence starting at position 713 of the cDNA sequence. 
It had the following sequence: 

5 '-GAG GGA CTC CAC AGT TAT GG-3 » (Seq. ID No.: 7). 



The other primer was located at position 1070 of the 
15 cDNA sequence and had the following sequence: 5»-GCA 
CAC AAT TAT TTG ATA TG-3'(Seq. ID No.: 8). These 
primers were annealed to the chromosomal DNA and the 
region between these primers was amplified as described 
above. The resulting fragment was »900 bp longer then 
20 expected. Sequence analysis showed that an intervening 
sequence of this size was present between nucleotide 737 
and 738 of the cDNA. The amplified fragment was treated 
with Klenow-polymerase to repair any overhanging ends 
and treated with kinase to attach phosphate groups to 
25 the ends of the fragment. The fragment was then ligated 
into pUCl9 previously cut with Smal. 
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D. Screening of a Bovine Phage Library 
for aSl-casein Flanking Seq uences 

A bovine genomic library, constructed in EMBL3, was 
obtained from Dr. M. Groenen, Agricultural University 
5 Wageningen, Netherlands, and was screened in the 
following way. The bacteriophage particle titre was 
determined on Escherichia coli MB406 a permissive host 
strain (Stratagene Inc.). For this, several dilutions 
of the phage stock were made in SM buffer (50 mM 

10 Tris.HCl pH 7.5, 100 mM NaCl, 10 mM MgS04, 0.01% 
gelatin) and mixed with 200 /il MB406 (O.D.^ « 0.9); 
after 20 minutes at 37 °C, 3 ml top agarose (Luria- 
Bertani medium, 0.8% agarose, 10 mM MgCl 2 ) was added and 
this was plated on LB plates and incubated overnight at 

15 37°C. 

Approximately 600,000 phages were then plated by adding 
the required amount of phage stock to 400 /il MB406. The 
subsequent plating was as described as above. The next 
step was transfer of the phage to nitrocellulose 

20 filters. Plates were placed at 4*C for one hour. 
Nitrocellulose filters (S&S) were placed on the top 
agarose layer and exact position was marked. After 
lifting, the filters were soaked for (1) 30 minutes in 
denaturation buffer (1.5M NaCl, 0.5M NaOH) ; (2) 5 

25 minutes in neutralizing buffer ( l . 5M NaCl , 0 . 5M Tris . HC1 
pH 8.0) . After rinsing with 2xSSPE (360 mM NaCl, 20 mM 
Naiy?0 4 , 2 mM EDTA) , the filters were baked under vacuum 
at 80 °C for two hours. 

Prehybridization of the filters was performed in a 
30 buffer containing 50% formamide, 5x Denhardt's solution 
(0.1% Ficoll, 0.1% polyvinylpydrolidone, 0.1% bovine 
serum albumin) , SxSSPE, 0,1% SDS and 100 fig /ml denatured 
salmon sperm DNA at 42 °C for two hours. Hybridization 
was performed in same buffer at 42 °C overnight in a 
35 shaking waterbath. The probe, generated as previously 
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described, was labelled using the Random Primed 
labelling kit from Boehringer Mannheim. After overnight 
hybridization the filters were washed three times with 
2xSSC, 0.1% SDS at room temperature* 

5 Overnight exposure of Kodak XAR films was performed with 
amplifying screens (Dupont) at -70 °C. Putative 
positives were plugged out of the plates and put 
overnight in SM buffer at 4°C. These were plated out 
as described above and DNA was isolated following the 

10 plate lysate method (Maniatis, T., et al. (1982), 
Molecular Cloning z A Laboratory Manual. Cold Spring 
Harbor, N.Y. ) . 5 ml SM buffer was added to the top 
agarose layer; after two hours gentle shaking buffer was 
removed and spun ~at v 4000 rpm at 4°C for 10 minutes. 

15 Supernatant was transf erred to sterile tubes and RNase 
A and DNasel (both final concentration 1/xg/ml) was 
added, this was incubated at 37 °C for 30 minutes. One 
volume of a 20% polyethyleneglycol, 2.5 M NaCl solution 
was added and put on ice for one hour. Centrifugation 

20 at 4000 rpm for 30 minutes at 4°C left precipitated 
bacteriophage particles. These were resuspended in 
500 ml SM buffer, SDS (final concentration 0.1%) and 
EDTA (final concentration 5 mM) was added, this was 
incubated at 68 °C for 15 minutes. Protein was removed 

25 with one phenol and one chloroform extraction step. 
Precipitation of phage DNA was performed with one volume 
isopropanol. Phage DNA was washed once with 70% ethanol 
and dissolved in 50 ml Tris.HCl pH 7.5, 1 mM EDTA 
buffer. 

30 Restriction enzyme analysis, agarose gel 
electrophoresis, transfer of DNA from gel to 
nitrocellulose filter and Southern blotting were all 
done according to standard procedures (Maniatis (1982) , 
Mr >m^T- cloning: A Laboratory Manual) . Hybridization 

35 with probes (described hereinafter) was performed 
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according to the same procedure as the screening 
conditions described above, 

E. Isolation of Clones Containing 

5' -flanking Reoion of Bovine si-casein 

5 Three putative clones were identified using the probe 
and procedures as described above. After another round 
of screening, clean recombinant bacteriophage was 
analyzed. Digestion of cloned DMA with Sail, EcoRI and 
Sfrll/fiCfrBI (double digestion) and hybridization with the 

10 probe described above showed identical inserts in all 
three clones. The insert consisted of an 18kb (partial 
Saua& fragment excised with Sail ) . Transciptional 
orientation in the clone was determined with 
hybridization of above described restriction fragments 

15 with (1) probe 1 described above, and (2) the Ncol-Nsil 
fragment of probe 1. This showed a region of about 16kb 
upstream of transcription start. Downstream from the 
transcription start was another 1.9kbp. Sequencing of 
part of the latter region showed the presence of exon 

20 2 and part of intron 2 of the bovine aSl-casein gene. 
Additional sequencing of the region-103 - +300 confirmed 
the identity of the clone. The ethidium-bromide pattern 
of the described restriction fragments also showed the 
orientation of the clone in the EMBL vector. Subsequent 

25 analysis of the clone with the following restriction 
enzymes (ficsl, £s£I, jCpnj, lagHI, Hindlll . Balm 
resulted in the restriction map of 5 1 flanking region 
of bovine Sl-casein gene as shown in Fig. 3. 

F. Isolation of Clones Containing 

30 3' -flanking Region of Bovine aSl-casein 

Duplicate nitrocellulose filters from the initial phage 
plating used for isolating 5 • clones were screened with 
the 3' aSl-casein probe using the same hybridization 
conditions previously described. Eight positive clones 
35 were identified after two rounds of screening. Phage 
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DNA was prepared as described. Subsequent restriction 
digests with Sail . EcoRI , and Sal/EcoRy and southern 
hybridization with the 3 V aSl probe showed identical 
inserts in seven of the eight clones. One clone 
5 containing an 18.5kb EcoRI insert was further analyzed 
with the restriction enzymes Bstell and BamHI . A 
restriction map of that clone is shown in Fig. 4. 

EXAMPLE 2 
Cloning pf Bvroan LactQf errin Sens 
10 a. Material*? 

Restriction endonucleases, T4 ligase, and T7 
polynucleotide kinase were obtained from Boehringer- 
Mannheim, New England Biolabs, or Bethesda Research 
Laboratories. Radio-isotopes were purchased from 
15 Amersham. A human mammary gland cDNA library in 
bacteriophage Xgtll was obtained from Clontech, Inc. , 
Palo Alto, Calif. 

B. Isolation of the Human Lactoferrin Gene 

The human mammary gland library was screened by standard 

20 plaque hybridization technique (Maniatis, et al. (1982) 
Molecular Cloning; A Laboratory Manual) with three 
synthetic oligomers. Two of the oligomers were 30 -mere 
corresponding to the cDNA sequence of Rado et al., 
supra . at amino acid positions 436*445 and 682-691* The 

25 third was a 21-mer "best guess" probe based on human 
codon bias and coding for amino acid sequence of HLF 
between amino acid residues 18 and 24. Respectively, 
they were: 

(1) 5 1 -CTTGCTGTGGCGGTGGTTAGGAGATCAGAC— 3 1 (Seq. ID 
30 NO.: 9) 

(2) 5 • -CTCCTGGAAGCCTGTGAATTCCTCAGGAAG-3 • (Seq. ID 
No. : 10) , and 

(3) 5 ' -ACCAAGTGCTTCCAGTGGCAG-3 • ( Seq. ID No. : 11). 
The probes were radiolabeled (Grouse et al. (1983) 

35 Methods Enzvmol . 10l f 78-98) and used to screen 
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duplicate filters. Filters were washed at a final 
stringency of 2 X SSC, 37 °C. 

c yu<?lfiQti<t^ sequence Analysis 

DNA fragments were isolated by use of low-melting 
5 agarose (Crouse et al, supra ) and subcloned into 
bacteriophase M13mpl8 or Ml3mpl9 (Messing et al. (1983) 
Methods Enzymol , 101 f 20-78) . The sequence was 
determined using the Sequenase enzyme (modified T7 DNA 
polymerase) (Tabor et al. (1987) prpc. Natl, Aca<U pcj t 

10 USA 84 . 4767-4771) . All reactions were carried out 
according to the manufacturer's specifications (US 
Biochemicals) . The sequence is shown in Fig. 1. The 
hLF sequence was digested with Hindlll and EcoRI 
(present in the surrounding phage sequences) and 

15 subcloned into the Hindlll and EcoRI site of pUC19 to 
form pUS119 Lacto 4.1. This clone contained the entire 
coding sequence of the mature form of hLF, but lacked 
the complete signal sequence. 



EXAMPLE 3 

20 Construction of bovine aSl-casein C AT vectors 

In order to determine whether the aSl-casein fragments 
obtained in Example l had promoter and other properties 
needed to express a heterologous gene, expression 
plasmids were constructed containing variable amounts 

25 of 5-' and 3 '-flanking regions from the aSl-casein gene. 
The chloramphenicol Acetyl transferase gene (CAT) was 
used as a heterologous gene in these vector constructs. 
The CAT gene is useful to detect the expression level 
for a heterologous gene construct since it is not 

30 normally present in mammalian cells and confers a 
readily detectable enzymatic activity (see Gorman, C.N. , 
etal. (1983), Mol. Cell. Biol. . 2, 1044-1051) which can 
be quantified in the cells or animals containing an 
expressible gene. 
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681 bp of a aSl-casein promoter plus the first .on- 
coding exon plus approximately 150 bp of the first 
intervening sequence (TVS) were isolated from a 5'- 
5 flanking genomic clone from Example 1 by PCR 
amplification as an Ncol-Hindlll fragment (approximately 
830 bp) . This fragment is identified as fragment 1 in 
Fig. 5A. The primer sequences consisted of: 

5 • -TCCATGGGGGTCACAAAGAACTGGAC-3 • 
10 (Seq. ID No.: 12) and 

5 • -TGAAGCTTGCTAACAGTATATCATAGG-3 • 
(Seq. ID No.: 13) 

that were designed from a sequence published by Yu-Lee 
et al. (1986) Nuc. Acids Res . 1£, 1883-1902. 

15 Approximately 1.6kb (fragment 2, Fig. 5A) of aSl-casein 
3 '-flanking sequence was isolated by PCR amplification 
from a bovine 3 '-flanking genomic clone from Example l. 
This region contained the previously described splice 
within the 3 r untranslated region of aSl-casein gene. 

20 Fragment 2 was subcloned into the Smal site of pUCl9. 
The primer sequences consisted of: 

5 • — GAGGGACTCCACAGTTATGG-3 • 
(Seq. ID No.: 14) and 
5 • -GCACACAATTATTTGATATG-3 ' 
25 (Seq. ID No.: 15) 

that were designed from a sequence published by Stewart 
et al. (1984) Nucl. Acids Res . 12, 3895-3907. 

A hybrid splicing signal comprising the 3 1 splice site 
of an immunoglobulin gene (Bothwell et al. (1981) , Cell r 
30 21, 625-637) was synthetically prepared and inserted 
into pUC18 along with unique restriction sites flanking 
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either side to produce pMH-1. This plasmid is shown in 
Fig. 6. Ncol and Hindlll sites were designed such that 
ligation with fragment 1 from the bovine 5 • genomic 
clone would result in the functional hybrid splice 
5 sequence. See Fig. 11* 

A polyadenylation sequence was obtained from SV40 virus 
as a BamHI-Dral fragment (fragment 3 in Fig. 5A) 
isolated from pRSVcat (Gorman, CM., et al. (1982), 
Proc. Natl - Acad. Set.. 22./ 6777-6781). 

10 A bacterial CAT coding sequence was subcloned into pUC19 
as a Pstl-BamHI fragment. 

B. Construction of pS13'5'CAT 

Fragment I of aSl-casein promoter was subcloned into 
pMH-1 (Fig. 6) between the Ncol and Hindlll sites to 
15 form pMHSIS 1 flank. 

The SV40 polyadenylation sequence (fragment 3) was 
subcloned as a BamHI-Dral fragment into pUC19 
immediately 3 9 to the 3' aSl-casein flanking sequence 
(fragment 2) to form pUC19 3 1 UTR/SV40. This allowed 

20 for the removal of a continuous EcoRI-Sall fragment 
(containing the 3* -flanking sequence and poly (A) 
sequence) that was subcloned into pMH-1 to derive 
pMHS13 •TOR (Fig. 5B) which was used later to construct 
pMHSI 3 ' UTR hlf which contains sequences encoding human 

25 lactof errin. 

The EcoRI-Sall sequence (fragments 2 and 3} were 
subcloned into the EcoRI-Sall sites of pMHSIS • flank to 
form pS^S'flank. 



The Pstl-BamHI CAT fragment (fragment 4 in Fig. 5B) , 
30 after blunting the BamHI site with Klenow, was subcloned 
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into pS^'S 1 flank (Fig. 5B) between the PstI and Smal 
sites to form pS13 B 5 f GAT* 

C Construct ion of pSIS'CAT 

The CAT fragment (fragment 4 in Fig. 5B, Pstl-BamHI) and 
5 SV40 polyadenylation fragment (fragment 3 in Fig. 5A, 
BamHI-Dral) were subcloned into the PstI and Smal sites 
of pMHS15' flank to form pS^CAT (Fig. 5C) . 

D. Assay for CAT Production 

Each of these CAT plasmids were transf ected into human 
10 293S cells (Graham, F.L. , et al. (1977), J. <fen- ViXQl*, 
36 , 59-72) by the calcium phosphate co-precipitation 
method (Gorman, CM., et al. (1983) , Ssi§GSfi/ 221, 551; 
Graham, F.L., et al. (1973), Virology, 5Z, 456-467). 
Cells were harvested 44 hours after transfection and 
15 cell extracts were assayed for CAT activity (Gorman, 

CM., et al. (1982), Mol. Cell. PjoXw 2, 1011; 

deCrombrugghe, B., et al. (1973), Nature [London], 241, 
237-251, as modified by Nordeen, S.K., et al. (1987), 
DNA . £, 173-178). A control plasmid expressing CAT 
20 driven by the Cytomegalovirus Immediate early promoter 
(Boshart, M. , et al. (1985), £sll, 41, 521) was 
transf ected into human 293 S cells to assay for 
transf ected efficiency. 

pS13 , 5 l CAT was expressed in these cells at a level which 
25 was approximately 30-100 fold lower than the control 
plasmid, but significantly higher than background. 
Primer extension analysis indicated that transcription 
had initiated predominantly in the expected region. 



When pSlS'CAT was transf ected into 293S cells, 
30 expression was also detected. 
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Bovine as 1-casein/ human lactoferrin 

A. Construction of DNA Sequences. 

5 16kb of bovine aSl— casein 5 '-flanking sequence from 
Example 1 was isolated from the bovine genomic library 
(phage GP1) as a Sall-Bglll fragment. The BglXI site 
lies at the junction of the first intron and second exon 
of the aSl-casein gene. 

10 Bovine aSl-casein signal sequence (Stewart et al. (1984) 
Nucl. Acids Res . 12 P 3895) was prepared from synthetic 
DNA synthesized on a Cylone Plus® DNA Synthesizer 
(Hillgen/Biosearch X) and contained the entire signal 
sequence plus Xhol and Cla I sites attached to the 

15 5 f -end r and Nael to the 3 '-end (fragment 8 r Fig. 7B) . 

Cleavage of pUC119 Lacto 4.1 with Eael precisely opened 
the plasmid at the codon for the first amino acid of 
mature hLF. Treatment with Klenow was used to fill in 
the overhanging 5 »-end. Further digestion with AccI and 
20 EcoRI gave two fragments: (a) an Eael-AccI fragment 
containing the first 243 bp of mature hLF (fragment 5, 
Fig. 7C) , and (b) a contiguous AccI -EcoRI fragment 
(fragment 6, Fig. 7C) of 1815 bp that contained all but 
five terminal codons of the remaining coding sequence. 

25 A synthetic linker was prepared that contained the last 
five codons of hLF beginning at the EcoRI site and 
extending for four bases beyond the stop codon. A Kpnl 
site was added to the 3' -end (fragment 7 in Fig. 7C) . 

An 8 . 5kb EcoRI 3 ' -fragment was isolated from the bovine 
30 genomic library (Fig. 4) containing sequences beginning 
just downstream of the coding region of a S 1-casein and 
a BstEII site approximately 350 bp from the 5 f -end. 
This fragment was subcloned into pMH-1 at the EcoRI site 
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to form pMH3'E10 {Fig. 7A) . A Sail site is adjacent to 
the 3«-EcoRI site in pMH3»E10. 

B. construction of CGPIHLF 

The hLF 3 '-linker (fragment 7, Fig. 7C) was subcloned 
5 into the EcoRI-Kpnl sites of pMH3'UTR (Fig. 7A) to 
produce pMH3 •OTRhLF2 linker (Fig. 7A) . 

The synthetic bovine oSl-casein signal sequence 
(fragment 8) was then subcloned into the Xhol and Smal 
sites of pMH3 * DTRhLF2 linker to make pS13'hLFl/2L (Fig. 
10 7B) . 

The two hLF coding fragments (fragments 5 and 6 in Fig. 
7C) were subcloned into the Nae^' and EcoRI sites of 
pS13*hLFl/2L (Fig. 7B) to make pS13'UTRhLF (Fig. 7C) . 

The large crSl-casein 3'UTR fragment from pMH3«E10 (Fig. 
15 7A) was isolated as a BstEII-Sall fragment and subcloned 
into the same sites of pS13»UTRhLF to form phLF3'10kb 
(Fig. 7D). 

Cosmid cGPlHLF was prepared from a 3-way ligation (Fig. 
7F) : 

20 (1) the 16kb 5' -flanking sequence from phage 

GP1 (Example 1, Fig. 3) was modified by attaching two 
linker adapters. The Sail site at the 5 '-end was 
ligated to a Notl-Sall linker. The Bglll site at the 
3 '-end was ligated to a Bglll-Xhol linker; 

25 (2 ) the hLF coding region, flanked on the 5 • -end 

by the crSl-casein signal sequence and on the 3' -end 
by approximately 8.5kb of oSl-casein 3' -flanking 
sequence, was isolated as a Xhol-Sall fragment from 
phLF3'10kb. The Sail site at the 5 '-end was ligated 

30 to a Sall-NotI linker; 
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(c) Cosmid pWE15 (Stratagene, Inc.) was 
linearized with Not I. 

Fragments from (a) , (b) , and (c) were ligated together 
and transfected into bacteria using commercial lambda 
5 packaging extracts (Stratagene, Inc.) to produce 
cGPlHLF. 

EXAMPLE 5 

Bovine aSl-casein/hUF expression plasmids. 

A. Construction of pS13'5'hLF 

10 The Hindlll-Sall fragment of pS13'UTRhLF was subcloned 
into the same sites in pMHS15 • flank to form pSlB^hLF 
(Fig. 7E) . This plasmid contains 681 bp of bovine 
aSl-casein promoter sequence, the a&i-casein/lgG hybrid 
intron, the aSl-casein signal sequence, the hLF coding 

15 region, approximately 1.6kb of aSl-casein 3 9 -flanking 
sequence, and the SV40 late region polyadenylation 
sequence. 

B. pSlS'hLf 

Plasmid pS13 'S'liLF (Fig. 7E) was cut with Kpnl and BamHI 
20 which border the aSl-casein 1.6kb 3 '-flanking sequence. 
The larger vector fragment was purified, made blunt 
ended with Klenow, and self-ligated to form pS15 v hLF. 

C. RadjQifflmunQ^g^^Y t9V W? 

An immunoglobulin-enriched fraction of ascites fluid of 
25 a monoclonal antibody against human lactoferrin, which 
does not cross-react with the bovine or murine protein, 
was prepared by 50% ammonium sulfate precipitation and 
coupled to CNBr-activated Sepharose 4B ( 20 mg of 
protein to 1 g of Sepharose) . The Sepharose beads were 
30 suspended (2 mg/ml) in phosphate-buffered saline (PBS; 
10 mM sodium phosphate, 0.14 M NaCl containing 10 mM 
EDTA, 0.1% ( w /v) Polylorene and 0.02% ( w /v) NaN 3 , pH 7.4. 
Sepharose suspensions (0.3 ml) were incubated for five 
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hours at room temperature by head-over-head rotation 
with samples (usually 50/il) in 2 -ml polystyrene tubes. 
Sepharose beads were then washed with saline (five times 
with 1.5 ml) and incubated for 16 hours at room 
5 temperature with 50/il (lkBq) of I25 I-labeled-af f inity- 
purified polyclonal rabbit anti human lactoferrin 
antibodies, together with 0.5 ml of PBS, 0.1% ( w /v) 
Tween-20. Thereafter the Sepharose was washed again 
with saline (four times with 1.5 ml) and bound radio 

10 activity was measured. Results were expressed as 
percent binding of the labelled antibodies added. 
Levels of lactoferrin in test samples were expressed in 
nanomolar, using purified human milk lactoferrin as a 
standard (serial dilutions in PBS, 10 mM EDTA, 0.1% ( w /v) 

15 ¥ween-20. ' " 

Repeated testing of standard on separate occasions 
revealed that this RiA was highly reproducible, intra- 
and inter assay coefficients of variation ranged from 
5-10%* As little as 0.1 nanogram human lactoferrin is 
20 easily detected by this RIA. 

D. Eaqaye^lon in pellg 

293S cells were transfected with the above hLF plasmids 
as described (1/ig of a CMV-CAT plasmid was co- 
transfected as control for transf ection efficiency) . 
25 Forty-f our hours after transf ection medium was removed 
from the cells and assayed for hLF as described supra . 
RNA was isolated as described by Stryker, et al. (1989) 
EMBO J. 2669. The results can be summarized as 
follows: 

30 1. Transf ection efficiencies are identical for the 
two hLF plasmids; 

2. hLF is expressed in the cells and secreted into 
the medium. In both cases, the levels are about 0.4jug/ml 
medium using about 3x 10* cells 
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3. The proteins behave identical to hLF in a human 
milk sample in a dose response assaymeasuring the amount 
of 125 I- anti-lactoferrin bound as a function of the 
amount of sample used. 
5 4. The protein has about the same size (~80kD) as in 
a human milk sample as judged by Western blotting. 

5. The hLF ENA produced in the cells has the correct 
size and its level is similar for both plasmids as 
judged by Northern - blotting. 

10 These data indicate that these two expression plasmids 
are able to express hLF. By all standards used so far, 
the protein is identical to hLF present in human milk. 
The heterologous signal sequence is functional in that 
1 £ it promotes secretion of the protein from the cells into 
15 the medium. Further , the casein regulatory sequences 
used in these plasmids are able to promote expression 
of a heterologous gene. 

EXAMPLE 6 

In vitro Maturation, Fertilization 
20 and Culture of Bovine Oocytes — 

Immature oocytes are obtained in large quantity 
(400-600/day) by aspirating follicles of ovaries 
obtained at abbatoirs. Immature oocytes are cultured 
25 for a period in vitro before they are competent to be 
fertilized. Once "matured", oocytes are fertilized with 
sperm which has also been matured, or "capacitated" in 
vitro . The pronuclei of the fertilized oocyte is then 
injected with the transgene encoding for the expression 
30 and secretion of human lactoferrin. Zygotes resulting 
from this in vitro fertilization and microinjection are 
then cultured to the late morula or blastocyst stage 
(5-6 days) in medium prepared, or "conditioned 11 by 
oviductal tissue. Blastocysts are then transferred non- 
35 surgically to recipient cattle for the balance of 
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gestation or analyzed for integration of the transgene 
as described herein. 

In vitro maturation £1311. Ovaries are obtained 

immediately after slaughter at local abbatoirs and 
5 oocytes are recovered. Alternatively, oocytes are 
obtained from living cattle by surgical, endoscopic, or 
transvaginal ultrasonic approaches. In all cases, 
oocytes are aspirated from ovarian follicles (2-10 mm 
diameter) . After washing, oocytes are placed in a 
10 maturation medium consisting of H199 supplemented with 
10% fetal calf serum, and incubated for 24 hours at 
39«C. Sirard et al. (1988) Biol. Renrod. 29, 546-552. 

■S-i .»v. 

In vitro fertilization (TVF\ . Matured oocytes^ are 
fertilized with either fresh or thawed sperm. Sperm are 

15 prepared for fertilization by first obtaining a 
population of sperm enriched for motility by a "swim-up 11 
separation technique (Parrish et al. (1986) 
Therioaenology 25 . 591-600) • Motil sperm are then added 
to a fertilization media, consisting of a modified 

20 Tyrode's solution (Parrish et al. (1986) supra .) 
supplemented with heparin to induce sperm capacitation 
(Parrish et al. (1988) Biol. Reorod . 3£, 1171-1180). 
Capacitation constitutes the final sperm maturation 
process which is essential for fertilization. Sperm and 

25 oocytes are co-cultured for 18 hours. A useful feature 
of this IVF method is that (in the case of frozen sperm) 
consistent, repeatable results are obtained once optimal 
fertilization conditions for a particular ejaculate have 
been defined (Parrish et al. (1986) supra .) . 

30 In vitro culture ( IVO . Conventional culture systems, 
which support development of murine, rabbit, or human 
ova, do not support development of bovine embryos past 
the 8-16 cell stage. This problem has been overcome by 
pre-conditioning culture media with oviductal tissue. 
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Oviduct-conditioned medium will support bovine embryos 
past the 8-16 cell stage to the blastocyst stage in 
vitro (Eyestone and First (1989) J. Renrod, Fert. 
715-720) . 

5 Bovine embryos have proved refractory to in vfoyQ 
culture. This in part stems from the existence of a 
"block" to cleavage in vitro at the 8-16 cell stage. 
This block may be alleviated by culturing embryos in the 
oviducts of rabbits (reviewed by Boland (1984) 
10 T^ rioqenoloqY 21r 126-137) or sheep (Willadeen (1982) 
in: Mammalian Egg Transfer, (E. Adams, ed. , 
pp. 185-210) ) ; Eyestone et al. (1987) Therioaenoloov 28 , 
1-7). However, these in vivq alternatives have been 
less than ideal, in that: (1) they require the 
15 maintenance of large numbers of recipient animals, 
(2) they require surgery to gain access to the oviducts 
for transfer, and a second surgery (or sacrifice) to 
recover the embryos, (3) all transferred embryos are 
seldom recovered, and (4) access to embryos during 
20 culture for observation or treatment is entirely 
precluded. The lack of in vitro culture systems has 
hampered the development of various manipulation 
techniques (such as gene transfer by pronuclear 
injection) by preventing accumulation of basic 
25 information of the chronology and ontogeny of bovine 
development, and by complicating the process of 
culturing embryos to a stage compatible with non- 
surgical embryo transfer and cryopreservation techniques 
(e.g., late blastocyst stages). 

30 Bovine embryos did not yield to attempts to culture them 
in vitro past the 8-16 cell "block" until Camous et al. 
(1984) J- Renrod. Fert . 2£, 479-485 demonstrated 
cleavage to 216 cells when embryos were co-cultured with 
trophoblastic tissue. 
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The co-culture procedure was extended to oviductal 
tissue, based on the ability of homo- or hetero-oviducts 
to support development from zygote to blastocyst. Thus, 
bovine embryos co-cultured with oviductal tissue, or in 
5 medium conditioned by oviductal tissue, developed from 
zygote to blastocyst in vitro (Eyes tone and First, 
(1989) J- Reorod. Pert . 715-720; Eyestone W.H. 

(1989) "Factors affecting the development of early 
bovine embryos in vivo and in vitro . H Ph.D. Thesis, 

10 University of Wisconsin) . Blastocysts have been 
produced in this system after superovulation and 
artificial insemination, or by in vitro maturation 
(IVM), and fertilization (IVF) of immature oocytes. 
Blastocysts produced in this fashion resulted in 

15 pregnancies and live calves after transfer to recipient 
animals. The results obtained were as follows: 



Efficiency Number 

Step (*) (pgr 1QQ) 

IVM 90 90 

20 IVF 80 72 

IVC 30 22 

Embryo transfer 50 11 
(% pregnant) 



Therefore, from an initial daily harvest of 500 oocytes, 
25 it is expected the approximately 55 pregnancies will 
result. 

Preparation of Oviduct Tissue 
Co-Culture and Conditio ned Medium 

1. obtain bovine oviducts after slaughter or by 
3 0 salpingectomy • 

2. Harvest lumenal tissue by scraping intact 
oviduct gently with a glass slide. 
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3 . Wash tissue 5 times in 10 ml modified tyrodes- 
hepes solution (Parrish et al. (1988) Biol, Reorod . 18, 
1171-1180) . 

4. Resuspend final tissue pellet in Ml 9 9 + 10% 
5 fetal calf serum at a ratio of 1 volume tissue: 50 

volumes of media. 

5. Tissue suspension can be used for embryo-co- 
culture - 

6. Alternatively, media may be conditioned for 
10 48h; after centrifuging the suspension, the supernatant 

may be used as embryo culture medium* Conditioned 
medium may be stored at -70 °C # if desired. Conditioned 
medium should be used at full strength for embryo 
culture (no dilution) (Eyestone (1989) jM£) - 

EXAMPLE 7 

Microinjection of hLF Transoene into Bovine Pronuclei 
The DNA fragment containing the hLF expression unit is 
excised from the vector by digestion with the 
appropriate restriction enzyme (s) and separated on 
agarose gels. The fragment is purified by 
electroelution, phenol and chloroform extraction and 
ethanol precipitation (Maniatis et al.). The DNA 
fragment is dissolved in and dialyzed in 10 mM tris, 
0.1 mM EDTA pH 7.2 at a concentration of 1 to 2/ig/ml. 
Microinjection needles are filled with the dialyzed DNA 
solution. 

Before in vitro fertilization, cumulus cells are removed 
from the egg by either vortexing at maximal speed for 
2 minutes or pipetting the eggs up and down several 
30 times in a standard micropipet. Bovine pronuclei are 
injected in principle as murine pronuclei (Hogan, B. et 
al. (1986) in: Manipulating the mouse embryo. Cold 
Spring Harbor Laboratory) with an additional 
centrifugation step in order to visualize the pronuclei. 



15 



20 



25 



The injection takes place 18-24 hours after 
fertilization. The time varies depending on the bull 
used as a source of semen. Different batches of semen 
cause the nuclei to become visible at different times. 

5 Bovine oocytes, matured and fertilized in vjtVQ, are 
spun in an eppendorf tube in l ml of tyrodes-hepes 
solution (Parrish (1987)) at 14500 g for eight minutes 
(Wall et al. (1985) Reorod. 22, 645-651). The 

embryos are transferred to a drop of tyrodes-hepes 

10 solution on a microscope slide covered with paraffin 
oil. Using a hydraulic system the oocytes are fixed to 
the egg holder in such a way that both the pronuclei are 
visible (using interference-contrast or phase contrast 
optics) . If necessary, the oocytes are rolled to change 

15 their position on the egg holder to visualize the 
pronuclei. The injection needle is brought into the 
same sharp focus of one of the pronuclei. The needle 
is then advanced through the zona pellucida, cytoplasm 
into the pronucleus. A small volume of 1-3 pi is 

20 injected (containing 20-100 DNA copies) into the 
pronucleus either by using a constant flow or a pulse 
flow (using a switch) of DNA solution out of the needle. 
Alternatively, two cell stage embryos are spun as 
described and the nuclei of both blastomers are injected 

25 as described. The injected embryos are then transferred 
to a drop of co-culture medium as described in Example 6 
in order to develop to the morula or blastocyst stage. 



rex AMPLE 8 

ffa rlY n? teGtiow nf Tran soenesis with hLF Transqene 
30 Upon the microinjection of a construct, the oocyte is 
cultured. A proper site of each embryo is cleaved and 
subjected to lysis (King, D. et al. (1988) Molecular 
production , Pevaloiraent 1, 57-62), proteolysis 

(Higuchi, R. , (1989) -Amplifications (A forum for PCR 
35 Users." 2., 1-3) and DPNI digestion. PCR is performed 
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as described previously (Ninomiy, T- et al. (1979) 
Molecular Reprod. and Devel . i, 242-248) with sets of 
two primers, one in aSl and the other in hLF cDKA 
sequence. For example, in a PCR where the forward 
5 primer (30mer) aSl sequence is 

ATG AAA CTT ATC CTC ACC TGT CTT GTG 
(Seq. ID No.: 16) 
and the reverse primer (30mer) in hLF sequence is GGG TTT 
TCG AGG GTG CCC CCG AGG ATG GAT (Seq. ID No. : 17) ; 971- 
10 1000 of Figure 1) , a 990 bp fragment will be generated. 
This fragment contains the hitherto inactivated DpNI 
site by loss of adenosine-methylation, at 934 bp away 
from the start of the forward primer. 

.j .... 

15 Production of hLF in Milk of Bovine Species 

Bovine morula developed from microinjected oocytes are 
split according to the method of Donahue (Donahue, S. 

(1986) ggnsttc Engineering Qf ftnjlmalq, ed. j. warren 

Evans et al., Plenum) • One half of the morula is kept 
20 in culture to develop into blastocysts. The other half 
is subjected to the DNA analysis as described in 
Example 8. When the result of this analysis is known, 
the morula kept in culture are developed into a 
blastocyst or as a source for nuclear transfer into 
25 enucleated zygotes. Blastocyst transfer into 

synchronized cows is performed according to the method 
of Betteridge (Betteridge, K.J. (1977) in: Embryo 
transfer in farm animals: a review of techniques and 
applications) • 

30 hLF is detected in the milk of lactating transgenic 
offspring using the R1A of Example 5. 
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Three overlapping phage clones that contain the complete 
hSA gene are used to construct an expression vector for 
5 hSA. They are designated XHAL-HA1, XHAL-3W and XHAL- 
H14. They are described in Urano, et al« (1986), 
Biol. Chem. . 261 . 3244-3251; and Urano / et al. (1984) , 
Gene . 2Zr 255-261. The sequence of the gene plus some 
surrounding regions is published in Minghetti, et al* 
10 (1986), J. Biol. Chem . . Z&l, 6747-6757. A single phage 
containing the complete hSA gene is constructed as 
follows: 

Clone HA-1 is cut with BstEII and Ahall. The *1400 bp 
fragment running from position 1784 (in the first exon, 

15 just downstream of the ATG) to 3181 is isolated and a 
synthetic linker is attached to the BstEII site at the 
5 1 end containing the first few amino acids that are cut 
off with BstEII as well as the sequence surrounding the 
ATG as well as a few convenient restriction sites. This 

20 fragment is called fragment #1. 

Clone 3W is cut with Ahall and SacI the »13 . Ucb fragment 
running from position 3181 to 16322 is isolated and a 
synthetic linker is attached to the SacI site to 
facilitate cloning in phage EMBL3. This fragment is 
25 called fragment #2. 

These two fragments are ligated and cloned in phage 
EMBL3. After identification of the correct phage, a 
fragment running from just upstream of the BstEII site 
(where unique restriction sites have been introduced) 
30 to the SacI site are isolated and ligated from a SacI 
to Sail fragment (running from position 16322 to «21200 
isolated from clone H-14 . These two fragments are then 
ligated and cloned in EMBL4. 
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After cutting with Clal (just upstream of the BstEII 
site, newly introduced) and BamHI (just downstream of 
the Sail site in the phage DNA) this new clone yields 
a fragment containing the complete hSA gene with about 
5 2.5kb 3 '-flanking sequence. 

To construct an expression vector for hSA cosmid cGFlHLF 
is partially digested with Clal and BamHI . This removes 
the signal sequence, the coding sequence of hLF, the 3 
DTR and poly (A) addition region of orSl-casein as well 
10 as a small region 3' of the casein gene. 

This is ligated to the hSA fragment described above and 
the resulting cosmid is called cGPlHSA. 

The expression vector so formed contains, (1) 16kb of 
promoter sequences derived from the otSl-casein gene, 
15 (2) the first exon and intervening sequence of this gene 
both present in GP1, (3) the signal sequence of the hSA 
gene the complete genomic gene coding for hSA including 
2.5kb downstream of that gene, and (4) «8Jcb of 
3 '-flanking sequence derived from the aSl-casein gene. 

20 This transgene is used to produce transgenic bovine 
species producing hSA in their milk in a manner 
analogous to that used to produce hLF in the milk of 
bovine species* 

PWK IX 

25 Purification of HSA from the Milk of Bovine Species 
Purification of heterologous proteins from milk is 
facilitated by the fact that, following casein 
precipitation, those proteins, for the most part, are 
found in the whey fraction which is less contaminated 

30 than the production media used in microbial or cell- 
based systems. 
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Chromatographic techniques are preferred for the 
purification of hSA from cow milk. This approach 
produces a better recovery and higher albumin purity as 
well as a lower content of albumin polymers as compared 
5 with ethanol fractionation (Curling (1980) in: "Methods 
of Plasma Protein Fractionation", Curling, ed. , Academic 
Press London, UK; Curling et al. (1982) J. Parenteral 
Sci. yechnol . 2£, 59; Berglof et al. and Martinache et 
al. (1982) Joint Meeting IHS-ISBT, Budapest) . The 
10 specific transport role of hSA as well as its major role 
in maintaining intravascular osmotic pressure may also 
be better preserved upon chromatographic purification 
(Steinbruch (1982) , Joint Meeting ISH-ISBT, Budapest) . 

The following steps are used to recoveir^hS A produced in 
15 the milk of transgenic cows: 

1. Precipitation of caseins (about 80% of milk 
protein) and essentially all the milk fat at pH 4,5 
and/or by adding chymosin. The whey fraction 
contains the albumin; 

20 2* Affinity-chromatography of albumin on Cibacron blue 
36A-Sepharose CL-6B (Harvey (1980) in; Methods 
of Plasma Protein Fractionation, op. cit.) This 
step serves both to remove proteins other than 
albumin and to decrease the volume to be handled 

25 about 30-fold. Albumin is eluted from this matrix 

with 0.15 M NaCl and 20 mM sodium salicylate at 
pH 7.5; 

3. Buffer-exchange on Sephadex G-25: desalting into 
0.025 M sodium acetate, adjustment to pH 5.2, 

30 followed by filtration; 

4. Anion-exchange chromatography on DEAE-Sepharose 
CL-6B. Desorption of albumin at pH 4.5; 

5 . Cation-exchange chromatography on CM-Sepharose CL- 
6B. Albumin elution with 0.11 M sodium acetate. 
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pH 5.5 and concentration of albumin at a 6% (w/v) 

solution by ultrafiltration; and 
6. Gel filtration on Sephacryl S-200. Fraction of 

high-molecular weight protein (e.g. albumin 
5 polymers, pyrogens) is discarded. The main 

fraction (albumin monomers) is concentrated by 

ultrafiltration and formulated. 
It is to be noted that steps 3-6 are essentially 
identical to the method described by Curling and others 
10 (Curling (1980) op. cit.; Curling et al. (1982) op. 
cit.; Bergl»f et al. (1982) op. cit.) for the 
purification of hSA from plasma. 



EXAMPLE U . 

Transgenic Mice Containing the Human 
15 Serum Albumin (hSA) Transgene 

Generated bv H omologous Recombination 

Three overlapping genomic hSA clones were used to 
generate the hSA gene in transgenic mice, AHAL-HA1, 
XHAL-H14 and XHAL-3W, are shown in Figure 8 as reported 

20 by Urano, et al. (1984), Gene, 32/ 255-261 and Drano, 
et al. (1986), J, Biol. Chem. . 261 3244-3251. Briefly, 
a genomic library was constructed from a partial EcoRI 
digest of human fibroblast DNA. For the clones XHAL-H14 
and XHAL-3W, this library was screened with ^P-labeled 

25 human albumin genomic clones by hybridization in 1 M 
NaCl, 50 mM Tris-HCl (pH 8.0) , 10 mM EDTA, 0.1% SDS, 100 
ug/ml of sheared salmon sperm DNA and lOx Denhardt's 
solution at 65° C overnight after prehybridization in 
3x SSC and lOx Denhard^s solution. Following 

30 hybridization, filters were washed in 0.2x SSC and 0.1% 
SDS at 65° C. The isolation of the XHAL-HA1 clone was 
identical except that a 0 . 9 kb Bglll-EcoRI fragment from 
the 5 V end of XHAL-3W was used to screen the human 
fibroblast library. 
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These three hSA phage clones were used to generate three 
overlapping linear DNA fragments, which in composite 
comprised the whole HSA gene and flanking regions. The 
5 9 most fragment I was a EcoRI-EcoRI fragment isolated 
5 from XHAL-HA1 ; the middle fragment II was a Acyl 
(=AhaII) -Sad fragment of AHAL-3W; and the 3 • most 
fragment III was a Xhol-Sall fragment of XHAL-H14 
(Fig. 7) . The fragments were treated with klenow DNA 
polymerase and dNTP*s to fill in overhanging sticky 

10 ends. In some experiments , the blunt ended fragments 
were then treated with bacterial alkaline phosphatase 
to remove the 5* phosphate groups from each fragment. 
The overlapping DNA fragments were next concentrated 
then co injected into the male pronuclei of fertilized 

15 mouse eggs, according to published methods (Hogan, et al. 
(1986) in "Manipulating the Mouse Embryo: A Laboratory 
Manual* 1 , Cold Spring Harbor Laboratory) . While the 
number of molecules injected varied from «25 to «100 per 
egg cell, the ratio of the individual fragments was 

20 approximately 1:1:1. Embryos were implanted into the 
uteri of pseudo pregnant female mice according to the 
methods of Hogan, et al., supra . 

To assay correct homologous recombination of the three 
overlapping fragments and integration of the nascent 
25 transgene into the mouse genome, genomic DNA from the 
newborn pups was subject to the following specific 
digestions followed by Southern hybridization with HSA 
cDNA probes: 

Bst EI I: cuts outside the HSA gene region and yields an 
30 18 kb band if correct recombination occurred; 

Nco I: cuts outside the overlapping regions and yields 
bands of 8.0 and 9.3kb if correct recombination 
occurred; 
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Nco I + Hind III: cuts at several positions outside the 
region of overlap, indicative of the presence of intact 
fragments; 

Hinc II: cuts in the overlapping regions, yielding 
5 several bands indicative of correct arrangement in these 
regions. 

In an initial experiment of 28 transgenic animals born, 
22 had correctly recombined all three fragments. Prom 
20 out of those 22 animals blood was collected and 

10 assayed for the presence of hSA protein using a radio 
immuno assay. 15 out of those 20 animals showed hSA 
expression at levels between .0.5 and 5 pg/mL. None of 
the animals that had no recombination or that were not 
transgenic showed any expression. Using RNA blots, only 

15 two (the two with the highest protein level) showed a 
band. We are currently performing blots on RNA that has 
been enriched for the presence of mRNA (i.e., poly (A) + 
RNA) . Using reverse transcriptase to synthesize cDNA, 
followed by PCR, we have observed a perfect relationship 

20 between the presence of RNA and protein. However, in 
this experiment we could not determine the size(s) of 
the RNA. 

EXMfffcP 13 

Alternate Co nstruction of Transaenes Encoding hLF 
25 This example describes the construction of two hLF 
transgenes wherein the first contains approximately 16kb 
of aSl-casein 5 • expression regulation sequence (pGPlhLF 
(16kb) also referred to as p!6,8HLF4) and the second 
contains approximately 7 . 9fcb of aSl-casein 5 • expression 
30 regulation sequence (pGPlhLF (8kb) also referred to as 
p8*8HLF4) . The overall strategy for these constructions 
is depicted in Fig. 9. 
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A 1.8kb EcoRI-Bglll fragment (fragment C in Fig. 9) was 
isolated from phage clone GP1. This fragment runs from 
position -100 of the transcription start site into the 
second exon of the aSl-casein gene. The Bglll site lies 
5 at the junction of the first entron and second exon of 
the aSl-casein gene. The 3* end containing the Bglll 
site was ligated to a synthetic Bglll-Clal linker and 
subcloned into the plasmid pUC19 . The resulting plasmid 
is designated pEBS. 

10 Fragment B in Fig. 9 was isolated as an EcoRI fragment 
and cloned into the EcoRI site of pEBS. Fragment B 
includes sequences from position -7500 to position -100 
of the transcription start site in the aSl-casein gene. 
The plasmid so formed is designated, pEB3S~and contains 

15 the combination of fragments B and C is the 8 . 9kb EcoRI - 
Clal fragment running from position -7500 to position 
+1400 of the transcription start site. The 8.9kb EcoRI- 
Clal fragment from pEB3 , obtained by complete digestion 
with Clal and partial digestion with EcoRI was isolated 

20 and subcloned into EcoRI-Clal cut pKUN2 ( a derivative 
of pKON; Gene (1986) 269-276 containing a NotI 

restriction site) to form pNE3BS. 

An 8.5kb Clal-EcoRI fragment (fragment A in Fig. 9) 
running from position -16000 to position -7500 of the 
25 transcription start site was isolated from phage GP1. 
It was thereafter subcloned into ptJC19 to form pSE. 
Using synthetic oligonucleotide, a unique NotI site was 
introduced into the Clal site thereby destroying it. 
The resulting plasmid is designated pNE. 

30 The insert from pNE was isolated as a Notl-EcoRI 
fragment and together with the EcoRI-Clal insert from 
pNE3BS was ligated into the cloning vector pK0N2. The 
resulting plasmid pGPl (A2ex) contains 16kb of oSl- 



WO 91/08216 



-74- 



PCT/US90/06874 



casein promoter plus the 5 ' end of the gene to the Bglll 
site at the border of the second exon. 

The final plasmid (16,8HLF4) containing the transgene 
was assembled using the Notl-Clal fragment from clone 
5 pGPI (A2ex) and the Xho-NotI fragment from clone pHLF 
3' lOkb. The structure of this transgene is the same 
as previously described herein. 

As a minor modification to this plasmid the Sail site 
of this plasmid was removed by cutting with Sail and 
10 inserting a linker that contains a NotI site, but not 
a Sail site. Subsequently, a Sail site was introduced 
just downstream of the hLF sequence by cutting the Kpnl 
site at that position adding the following linker: 

5 1 -CGTCGACAGTAC-3 * (Seq. ID No.: 18) 
15 CATGGCAGCTGT— 5 • (Seq. ID No.: 19) 

In effect, the hLF sequence is now surrounded by two 
unique restriction sites (Clal and Sail) and can be 
replaced by any recombinant ANA sequence that has a 
Clal-site at the 5 1 - end and a Sail-site at the 3 1 - end. 

20 Another transgene was constructed that is identical to 
the foregoing except that it contains only about 8kb of 
5' aSl-casein expression regulation sequence. It was 
constructed by taking the Notl-Clal fragment from pNE3BS 
and fusing it directly into Xho-NotI fragment from clone 

25 pHLF 3' lOkb. The resulting plasmid was designated 
pGPIhLF (7kb) (also referred to as p8.8HLF4). 

Plasmid 16,8hLF4 was modified to contain a hybrid splice 
signal (otSl-casein-IgG) described in examples 3 and 5. 
The resulting plasmid was designated 16,8hLF3 and is 
30 identical to 16,8hLF4 except for the presence of a 
hybrid intron versus a "natural" casein intron in the 
5»-UTR. 
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The hLF signal sequence can also be used in all of the 
cDNA constructs disclosed herein instead of the casein 
signal sequence. This can be done in the following way: 
A synthetic oligo was made that contains the complete 
5 hLF signal sequence (see Fig. 2) plus a Clal restriction 
site at the 5 '-end and an EagI restriction site at the 
3 '-end. These restriction sites also border the casein- 
signal sequence in the other plasmids (e.g. , pl6,8hLF4) • 
A fragment containing the hLF-cDNA surrounded by Clal 

10 and Sail sites was cloned in pGEM7 (Stratagene, Inc.) 
containing a Clal and Sail site. The resulting plasmid 
was digested with Clal and EagI and used as a vector to 
accommodate the Clal-EagI fragment containing the hLF 
sequence. From the positive clones, the cDNA, with its 

15 own sequence, was excised as a Clal-Sall fragment and 
inserted in Clal-Sall digested pl6,8hLF4 to generate 
pl6,8hLF5. Similarly, this Cla-Sal fragment containing 
the hLF-cDNA plus hLF signal sequence can be inserted 
in any hLF cDNA vector. 



20 EXfiMPftE 14 

Production of Recombinant Human Lactoferin 
in the MUK of Tranggepj-g Mi<re 

Transgenic mice were generated utilizing several of the 
transgenes identified in the examples herein. The 

25 transgenes used are identified in Table 3. In each 
case, the 5' and 3' expression regulation sequences were 
from the bovine aSl-casein gene, the RNA splice signal 
in the 5 ' untranslated region was either homologous from 
the aSl-casein gene or a hybrid casein-IgG intervening 

30 sequence. The recombinant DNA in each case was derived 
from cDNA clones. 
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10 EXAMPLE 15 

Construction of Transcrene Cassette 
for Genomic RfrCOTjfoinant DNA 

The plasmids described so fear all contain regions 
derived from the bovine otSl-casein untranscribed regions 

15 (including intervening sequences) . When a genomic gene 
is to be expressed that already contains untranslated 
regions and intervening sequences permissive for high 
expression, it is preferable to use expression cassettes 
where the flanking regions of the aSl-casein gene are 

20 operably linked to the untranslated regions of the gene 
to be expressed. Such an expression cassette is p- 
16kb,CS and was constructed as follows: plasmid pSl 
3'5'hI*F was used as a template in a PCR experiment. 
This plasmid contains 680 bp of promoter sequence of the 

25 aSl-casein gene as well as its first exon. the rest of 
this plasmid is not relevant for this experiment. The 
upstream primer was located just upstream of the insert 
in the plasmid moiety (just upstream of a NotI 
restriction site) . Its sequence is: S^CGA CGT TGT AAA 

30 ACG ACGG-3" . 

The downstream primer was located in exon 1. Its 
sequence matches the first 19 bp of the exon exactly and 
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also has a non-hydridizing region of 17 bp containing 
a Clal and a Sail site. It has the following sequence: 
5 • -ATTGTCGACTTATCGATGGGTTGATGATCAAGGTGA-3 • 

The amplified fragment was digested with NotI and Sail 
5 and ligated into pKUN2 (see Example 13) . The resulting 
plasmid (p-680CS) therefore harbors a proximal promoter 
fragment from -680 to + 19, plus two restriction sites 
just downstream of those 19 bp. 

This plasmid was digested with NotI (just upstream of - 
10 680) and Nsil (at-280) and used as a vector to ligate 
to a fragment running from a NotI site (just upstream 
of -16kb) to Nsil (-280) isolated -s from pl6,8hLF4 
(Example 13). This plasmid (p-16kb,CS) therefore 
harbors a promoter fragment from »-16,000 to +19. It 
15 can be used to insert genomic genes that carry their own 
OTR's and poly (A) -signal. After insertion of the 
genomic gene as a Clal-Sall fragment, the aSl-casein 3 •- 
flanking region can be inserted as a Sail-fragment. 

FYAMPT.E 16 

20 Construction of T r anfitrane fnr Production of Protein Q 
The genomic sequence of Protein C has been published. 
Foster, et al. (1985) proc. Nat]. Acad. Sci, USh S2., 
4673-4677. This sequence, however, does not include the 
first exon which was identified through the cDNA 

25 sequence published by Beckman, et al. (1985) Nucl, Acids 
Res. 12, 5233-5247. The first exon of Protein C is 
located at position -1499 to -1448 in the Foster 
sequence. The transgene for expressing and secreting 
Protein C into the milk of bovine species is shown in 

30 Fig. 10. This transgene was constructed as follows. 

A human genomic library in EMBL-3 (Clonotech) is probed 
with a sequence specific for protein C. A purified 
phage DNA prep containing the complete Protein C gene 
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is isolated. The phage is isolated from an JL. sslL 
strain having the Dam phenotype, such a strain GM113. 
This results in cloned DNA which is not methylated and 
as such all Clal restriction sites can be cleaved. 

5 A Clal Nhel fragment running from positions +1333 to 
11483 is isolated. This is designated fragment I. 

pGEM7 (Stratogene, Inc.) is digested with SphI and Smal. 
The region in between is replaced by the corresponding 
region of plasmid pKON (Gene (1986) 4£, 269-276) . The 
10 resulting plasmid is designated pGEM7A and has the 
following restriction map in the relevant region: 

Hindi-XT > uncial Xbal Sail Spel - 



15 Two primers are synthesized. Primer GP125 has the 
following sequence: 

5' - CAA ATC GAT TGA ACT TGC AGT ATC TCC ACG AC - 3 * 
Clal 

Primer GP 126 has the following sequence: 

20 5" - GGG ftTC GAT CAG ATT CTG TCC CCC AT - 3 • 

Clal 

Primer GP125 has an overlap with exon O (position 654 
to 675 of the Protein C gene) and introduces a Clal site 
in the 5 1 untranslated region. Exon O is the exon not 
25 identified by Foster, £t al. Primer GP126 overlaps the 
region from 1344 to 1315 in the Protein C gene. This 
region contains a Clal site. 

The region between position 654 and 1344 is amplified 
using either human DNA or phage DNA as a template. The 
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so amplified material is digested with Clal and cloned 
in vector pGEN7a to form pPCCC. This vector is 
propagated in a dam negative strain such as GH113 and 
partially cut with Clal (only the plasmids that are cut 
5 once with Clal at position 1340 are of interest) and 
completely with Xbal. The Clal Nhel fragment (fragment 
1) is cloned into this vector. The resultant plasmid 
is designated pPC. Its structure is shown in Fig. 10. 
From this plasmid, the Protein C transgene is isolated 
10 as a Clal-Sall fragment and ligated into pl6Kb, CS (See 
Example 15) to generate a transgene capable of 
expressing Protein C in bovine milk, this plasmid is 
designated pl6 Kb f CS,PC. 

The transgene contained within plasmid p 16 Kb, CS, PC 
15 is excised with NotI and used to generate transgenic 
bovine species as previously described. Such transgenic 
animals are capable of producing protein C in their 
milk. 



Having described the preferred embodiments of the 
20 present invention, it will appear to those ordinarily 
skilled in the art that various modifications may be 
made to the disclosed embodiments, and that such 
modifications are intended to be within the scope of the 
present invention. 

25 All references disclosed herein are expressly 
incorporated by reference. 
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WHftT IS CLAIMED IS; 

1. A transgene for producing a recombinant 
polypeptide in transgenic bovine species comprising at 
least one expression regulation DNA sequence functional 
in at least one cell-type of said bovine species 

5 operably linked to a recombinant DNA encoding a 

recombinant polypeptide, wherein said transgene is 
capable of directing the expression of said recombinant 
DNA sequence in at least said one cell-type of a bovine 
species containing said transgene to produce said 
10 recombinant polypeptide. 

2 . The transgene of Claim 1 wherein said expression 
regulation sequences comprise 5 V and 3' expression 
regulation sequences from a serum albumin, said cell- 
type is liver cell, said recombinant polypeptide is 

15 human serum albumin and said transgene further comprises 

a secretory DNA sequence functional in said liver cells 
and operably linked to the recombinant DNA encoding said 
human serum albumin* 

3. A transgene for producing a recombinant 
20 polypeptide in the milk of transgenic bovine species 

comprising at least one expression regulation DNA 
sequence functional in the mammary secretory cells of 
said bovine species, a secretory DNA sequence encoding 
a secretory signal sequence also functional in the 

25 mammary secretory cells of said bovine species and a 

recombinant DNA sequence encoding a recombinant 
polypeptide, said secretory DNA sequence being operably 
linked to said recombinant DNA sequence and to form a 
secretory-recombinant DNA sequence and said at least one 

30 expression regulation sequence being operably linked to 

said secretory-recombinant DNA sequence, wherein said 
transgene is capable of directing the expression of said 
secretory-recombinant DNA sequence in mammary secretory 
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cells of bovine species containing said transgene to 
produce a form of recombinant polypeptide which when 
secreted from said mammary secretory cells produces 
recombinant polypeptide in he milk of said ftovine 
species. 

4 . The transgene of Claim 1 or 3 further comprising 
a recombinant intervening sequence. 

5 . The transgene of Claim 4 wherein said recombinant 
intervening sequence is a hybrid intervening sequence. 

6. The transgene of Claim 5 wherein said hybrid 
intervening sequence contains a permissive RNA splice 
signal. 

7 • The transgene of Claim 3 wherein said recombinant 
polypeptide is a homologous polypeptide from bovine 
species. 

8. The transgene of Claim 7 wherein said homologous 
polypeptide is selected from the group consisting of 
caseins, lactoferrin, lysozyme, cholesterol hydrolase 
and serum albumin. 

9. The transgene of Claim 3 wherein said recombinant 
polypeptide is a heterologous polypeptide. 

10. The transgene of Claim 9 wherein said 
heterologous polypeptide is selected from the group 
consisting of human milk proteins, human serum proteins, 
and industrial enzymes. 

11. The transgene of Claim 10 wherein said 
heterologous polypeptide is a human milk protein. 
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12 • The transgene of Claim 11 wherein said human milk 
protein is selected from the group consisting of 
secretory immunoglobulins, lysozyme, lactoferrin, 
lactoglobulin, a-lactalbumin and bile salt-stimulated 
5 lipase. 

13. The transgene of Claim 12 wherein said milk 
protein is lactoferrin. 

14 • The transgene of Claim 10 wherein said 
heterologous polypeptide is a human serum protein. 

10 15. The transgene of Claim 14 wherein said human 

serum prqfcein is selected from the group consisting of 
albumin, immunoglobulin. Factor VIII, Factor IX and 
Protein C. 

16. The transgene of Claim 15 wherein said serum 
15 protein is albumin. 

17. The transgene of Claim 10 wherein said 
heterologous polypeptide is an industrial enzyme 
selected from the group consisting of proteases, 
lipases, chitenases and ligninases. 

18. The transgene of Claim 3 wherein said secretory 
DNA sequence is selected from the group consisting of 
DNA sequences encoding secretory signal sequences from 
human lactoferrin, human serum albumin, and secretory 
signal sequences from bovine aSl-casein, aS2 -casein, 0- 
casein, ic-casein, a-lactalbumin, 0-lactoglobulin, and 
serum albumin. 

19 • The transgene of Claim 18 wherein said secretory 
DNA sequence is the DNA sequence encoding the signal 
secretion sequence of bovine aSl casein. 



20 



25 
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20- The transgene of Claim 3 wherein. said at least 
one . expression regulation sequence comprises 5' 
expression regulation DNA sequences, wherein said 5 V 
expression regulation DNA sequence is operably linked 
5 to the 5' end of said secretory-recombinant DNA 

sequence. 



21. The transgene of Claim 20 wherein said 5 9 
expression regulation DNA sequence is selected from the 
group consisting of 5* expression regulation sequence 
10 from bovine genes encoding aSl-casein, aS2 -casein, 

0- casein, k -case in, a-lactalbumin, and 0-lactoglobulin. 



22. The transgene of Claim 21 wherein said 5 9 
expression regulation DNA sequence is a proximal 5 9 
expression regulation sequence comprising the promoter 
15 of bpvjlne aSl-casein. 



23. The transgene of Claim 22 wherein said 5 9 
expression regulation DNA sequence further comprises a 
distal 5 9 expression regulation sequence comprising 
5 9 -flanking DNA sequence from bovine aSl-casein. 



20 24* The transgene of Claim 20 further comprising 3 9 

expression regulation sequences operably linked to the 
3 9 end of said secretory-recombinant DNA sequence. 



25. The transgene of Claim 24 wherein said 3 9 
expression regulation sequence comprise 3 9 expression 
25 regulation sequence from bovine genes encoding aSl- 

casein, aS2 -casein, 0-casein, k-casein, a-lactalbumin, 
and 0-lactoglobulin. 



26. The transgene of Claim 25 wherein said 3 9 
expression regulation DNA sequence comprises a 3 9 
30 proximal expression regulation sequence from bovine 

aSl-casein. 
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27. The transgene of Claim 26 wherein said 3' 
expression regulation DNA sequence further comprises a 
3 V distal expression regulation sequence from bovine 
otSl-casein. 

5 28- The transgene of Claim 27 wherein said distal 

5 9 expression regulation DNA sequence comprises about 
a 30Jcb 5' -flanking region of bovine aSl-casein and said 
distal 3 1 expression regulation DNA sequence comprises 
about a 15kb 3* -flanking region of bovine aSl-casein. 

10 29. A transgenic bovine species capable of producing 

a recombinant polypeptide in at least one cell type of 
said animal, T 

30. A transgenic bovine species capable of producing 
recombinant polypeptide in the milk of said trangenic 

15 species. 

31. The transgenic bovine species of Claim 30 wherein 
said recombinant polypeptide is a homologous polypeptide 
from bovine species. 

32. The transgene bovine species of Claim 30 wherein 
20 said recombinant polypeptide is a heterologous 

polypeptide. 

33 . The transgenic bovine species of Claim 30 wherein 
said heterologous polypeptide is selected from the group 
consisting of human milk proteins, human serum proteins, 

25 and industrial enzymes. 

34. The transgenic bovine species of Claim 33 wherein 
said heterologous polypeptide is a human milk protein. 

35. The transgenic bovine species of Claim 34 wherein 
said human milk protein is selected from the group 



consisting of secretory immunoglobulins , lysozyme, 
lactoferrin, lactoglobulin, a-lactalbumin and bile salt- 
stimulated lipase. 

36 . The transgenic bovine species of Claim 35 wherein 
said milk protein is lactoferrin. 

37 . The transgenic bovine species of Claim 33 wherein 
said heterologous polypeptide is a human serum protein. 

38 . The transgenic bovine species of Claim 37 wherein 
said human serum protein is selected from the group 
consisting of albumin, immunoglobulin, Factor VIII, 
Factor IX and Protein C* 

39* The transgenic bovine species of Claim 38 wherein 
said serum protein is albumin. 

40. The transgenic bovine species of Claim 33 wherein 
said heterologous polypeptide is an industrial enzyme 
selected from the group consisting of proteases, 
lipases, chitinases and ligninases. 

41. Milk from transgenic bovine species containing 
a recombinant polypeptide* 

42. The milk of Claim 41 wherein said recombinant 
polypeptide is a homologous polypeptide from bovine 
species. 

43. The milk of Claim 41 wherein said recombinant 
polypeptide is a heterologous polypeptide. 

44. The milk of Claim 43 wherein said heterologous 
polypeptide is selected from the group consisting of 
human milk proteins, human serum proteins, and 
industrial enzymes. 



45. The milk of Claim 44 wherein said heterologous 
polypeptide is a human milk protein. 

46. The milk of Claim 45 wherein said human milk 
protein is selected from the group consisting of 
secretory immunoglobulins, lysozyme, lactoferrin, 
lactoglobulin, a~lactalbumin and bile salt-stimulated 
lipase. 

47* The milk of Claim 46 wherein said milk protein 
is lactoferrin. 

48. The milk of Claim 43 wherein said heterologous 
polypeptide is a human serum protein. 

49. The milk of Claim 48 wherein said human serum 
protein is selected from the group consisting of 
albumin, immunoglobulin. Factor VIII, Factor IX and 
Protein C. 

50. The milk of Claim 49 wherein said serum protein 
is albumin. 

51. A food formulation comprising transgenic milk 
containing a recombinant polypeptide. 

52* The food formulation of Claim 51 wherein said 
recombinant polypeptide is at least partially purified 
from said transgenic milk. 

53. The food formulation of Claim 31 formulated with 
nutrients appropriate for infant formula. 

54. A method for producing a transgenic bovine 
species capable of producing a recombinant polypeptide 
in the milk of said bovine species, said method 
comprising: 



introducing the transgene of Claim 1 into an 
embryonal target cell of a bovine species; 

transplanting the transgenic embryonal target 
cell formed thereby or the embryo obtained herefrom into 
a recipient female bovine parent; and 

identifying at least one female offspring which 
is capable of producing said recombinant polypeptide in 
the milk of said offspring. 

55. A method for producing a transgenic non-human 
mammal having a desirable phenotype comprising: 

(a) methylating a transgene capable of conferring 
said phenotype when incorporated into the cells of said 
transgenic non-human animal; 

(b) introducing said methylated transgene into 
fertilized oocytes of said non-human mammal to permit 
integration of said transgene into the genomic DNA of 
said fertilized oocytes; 

(c) culturing the individual oocytes formed hereby 
to pre-implantation embryos thereby replicating the 
genome of each of said fertilized oocytes; 

(d) removing at least one cell from each of said pre- 
implantation embryos and lysing said at least one cell 
to release the DNA contained therein; 

(e) contacting said released DNA with a restriction 
endonuclease capable of cleaving said methylated 
transgene but incapable of cleaving the unmethylated 
form of said transgene formed after integration into and 
replication of said genomic DNA; and 

(f) detecting which of said cells from said pre- 
implantation embryos contain a transgene which is 
resistant to cleavage by said restriction endonuclease 
as an indication of which pre-implantation embryos have 
integrated said transgene. 

56. The method of Claim 55 wherein said removal of 
at least one cell forms a first and second hemi -embryo 



for each of said pre- implementation embryos and each of 
said f irst hemi-embryos is lysed and analyzed according 
to steps (d) through (f ) , said method further 
comprising; 

(g) cloning at least one of said second hemi- 
embryos; and 

(h) to form a multiplicity of transgenic 
embryos. 

57. The method of Claim 56 further comprising 
transplanting more than one of said transgenic embryos 
into recipient female parents to produce a population 
containing at least two transgenic non-human animals 
having the same genotype. 

58. The method of Claim 55 further comprising 
transplanting the remainder of said pre-implantation 
embryo containing a genomically integrated transgene 
into a recipient female parent and identifying at least 
one offspring having said phenotype. 

59. The method of claim 55 wherein said restriction 
endonuclease is DPNI and said transgene is methylated 
at N6 of the adenine of the sequence 6ATC contained 
within said transgene. 

60. The method of Claim 59 wherein said detection 
utilizes a polymerase chain reaction using extension 
primers complementary to sequences upstream and 
downstream to said GATC sequence. 

61. The method of Claim 59 wherein said non-human 
transgenic mammal is bovine species, said transgene 
encodes a recombinant polypeptide and said desired 
phenotype is the ability to produce said recombinant 
polypeptide in the milk of said bovine species. 
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62. The method of Claim 61 wherein said transgene 
is the transgene of Claim 3. 
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'As sau961 
haelll 
asul 

5QU961 

nolIV 
hglJII 
ecoD109I 
bspl286 
banll 
asul 
nUIV 

mil aval opal mil 

nboll mil €coD1091 +thmi 

1 GGA CTT GTC TTC CTC GTC CTG CTG TTC C7C GGG GCC CTC GGA CTG 
-18 Gly Leu Val Phe Leu Val Leu Leu Phe Leu Gly Ala Leu Gly Leu 

haelll 

eael hlnPI 
cfrl hhal 
46 TGT CTG GCT GGC CGT AGG AGA AGG AGT GTT CAG TGG TGC GCC GTA TCC 
-3 Cys Leu'Ala Gly Arg Arg Arg Arg Ser Val Gin Trp Cys Ala Val Ser 

haelll 
mil 
aval hael 

94 CAA CCC GAG GCC ACA AAA 7CG TTC CAA TGG CAA AGG AAT ATG AGA AAA 
14 Gin Pro Glu Ale Thr Lys Cys Phe Gin Trp Gin Arg Asn Met Arg Lys 

mil fnu4HI 
squ961 bbvl plel 

haelll alul hlnfl bsri 

asul pvull bsnal fokl 

142 GTG CTG GGC CCT CCT GTC AGC TGC ATA AAG AGA GAC TCC CCC ATC CAG 
30 Val Arg Gly Pro Pro Val Ser Cys lie Lys Arg Asp Ser Pro lie Gin 

haelll 
hael 

scrFl haelll 

ecoRH sau961 

bs-tNI asul sfaNl 

190 TGT ATC CAG GCC ATT GCG GAA AAC AGG GCC GAT GCT GTG ACC CTT GAT 

46 Cys lie Gin Ale lie Ala Glu Asn Arg Ala Asp Ala Val Thr Leu Asp 

sau961 
nlalV 
scrFl 
ecoRll 
bstNl 
haelll 
s-tul haelll 
mil hael asul 
238 GOT GGT TTC ATA TAC GAG GCA GGC CTG GCC CCC TAC AAA CTG CGA CCT 
62 Gly Gly Phe He Tyr Glu Ala Gly Leu Ala Pro Tyr Lys Leu Arg Pro 
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sou961 ' 

avail 

asul 

fnu4Hl accl nlalV 
286 GTA GCG GCG GAA GTC TAG GGG ACC GAA AGA CAG CCA CGA ACT CAC TAT 
78 Val Ala Ala Glu Vol Tyr Gly Thr Glu Arg Gin Pro Arg Thr His Tyr 

fnu4HI 

nboll bbvl alul 

hphl fnu4HI alul pvull 

334 TAT GCC GTG GCT GTG GTG AAG AAG GGC GGC AGC TTT CAG CTG AAC GAA 
94 Tyr Ala Val Ala Val Val Lys Lys Gly Gly Ser Phe Gin Leu Asn Glu 

haelll squ%1 
s-tul avail 
bgll hael asul fokl 

382 CTG CAA GGT CTG AAG TCC TGC CAC ACA GGC CTT CGC AGG ACC GCT GGA 
110 Leu Gin Gly Leu Lys Ser Cys His Thr Gly Leu Arg Arg Thr Ala Gly 

sau%l 
avail 
asul 
nlalV 

430 TG6 ATT GTC CCT ACA GGG ACA CTT CGT CCA TTC TTG AAT TGG ACG GTT 
126 Trp Asn Val Pro Thr Gly Thr Leu Arg Pro Phe Leu Asn Trp Thr Gly 

hglJII alul 
bspl286 fnu4HI 

ban II bbvl ddel alul 

ddel mil pvull nboll pvull 

478 CCA CCT GAG CCC ATT GAG GCA GCT GTG CAG TTC TTC TCA GCC AGC TGT 
142 Pro Pro Glu Pro lie Glu Ala Ala Val Gin Phe Phe Ser Ala Ser Cys 

nspl 

hpall 
scrFI 
ncll 
caul I 

526 GTT CCC GGT GCA GAT AAA GGA CAG TTC CCC AAC CTG TGT CGC CTG TGT 
158 Val Pro Gin Ala Asp Lys Gly Gin Phe Pro Asn Leu Cys Arg Leu Cys 

nlalV 
scrFI 
ecoRII 
mil brtNl rsal 
574 GCG GGG ACA GGG GAA AAC AAA TGT GCC TTC TTC TTC CAG GAA CCG TAC 
174 Ala Gly Thr Gly Glu Asn Lys Cys Ala Phe Ser Ser Gin Glu Pro Tyr 
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nUIV ^ 5 
hglCl 

alul ban I dole I bsml bsmal 

622 TTC A6C TAC TCT GGT 6CC TTC AAG T6T CTG AGA GAC GGG GCT GGA GAC 
190 Phe Ser Tyr Ser Gly Ala Phe Lys Cys Leu Arg Asp Gly Ala Gly Asp 

sau961 
avail 
asul 
ppuMI 

hglAI ecoD109l 
bspl2B6 



mil 


mil 


GAG GAC 


CTG TCA GAC GAG 


G(u Asp 


Leu Ser Asp Glu 


CCA GAC 


AAC ACT CGG AAG 


Pro Asp 


Asn Ser Arg Lys 


scrFl 




ncll 




nspl 




hpall 




caul I 





xnal sau961 
snal nlalV 
scrFl 

ncll avail 
caull 
aval asul 
sau961 ppuMI 
haelll nlalV 

bsrl asul ecoQ1091 n I all I 

766 CCA G7G GAC AAG TTC AAA GAC TGC CAT CTG GCC CGG GTC CCT TCT CAT 
238 Pro Val Asp Lys Phe Lys Asp Cys His Leu Ala Arg Val Pro Ser His 

sfaNI 

fold nboll 
bgll dralll mil hlnfl 

814 GCC 5TT GTG GCA CGA AGT GTG AAT GGC AAG GAG GAT GCC ATC TGG AAT 
254 Ala Val Val Ala Arg Ser Val Asn Gly Lys Glu Asp Ala lie Trp Asn 

scrFl 
ecoRII 

bs-tNI hphl 
862 CTT CTC CGC CAG GCA CAG GAA AAG TTT GGA AAG GAC AAG TCA CCG AAA 
270 Leu Leu Arg Gin Ala Gin Glu Lys Phe Gly Lys Asp Lys Ser Pro Lys 
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%S sau3Al 
nbo] 
dpnl 
xholl 

alul bs-tYI 
bsiXI nlatV fag I II 

910 TTC CAG CTC TTT GGC TCC CCT AGT GGG CAG AAA GAT CTG CTG TTC AAG 
286 Phe Gin Leu Phe Gly Ser Pro Ser Gly Gin Lys Asp Leu Leu Phe Lys 

nlalV 
hglCI 

plel mil bspl286 mil 

hlnfl -taql ban] aval hlnfl 

958 GAC 7CT GCC ATT GGG TTT TCG AGG GTG CCC CCG AGG ATA GAT TCT GGG 
302 Asp Ser Ala He Gly Phe Ser Arg Vol Pro Pro Arg He Asp Ser Gly 

nspl 

sty I hpall 
rsaf ' J ftlalV fokl nnll- 

1006 CTG TAC CTT GGC TCC GGC TAC TTC ACT GCC ATC CAG AAC TTG AGG AAA 
318 Leu Tyr Leu Gly Ser Gly Tyr Phe Thr Ala lie Gin Asn Leu Arg Lys 

nspl 

hpall -thai 
scrFl fnuTII 
ncll bs-tlll 
mil fnu4HI hlnPI 

mil bbvl caul I hhal 

1054 AGT GAG GAG GAA GTG GCT GCC CGG CGT GCG CGG GTC GTG TGG TGT GCG 
344 Ser Glu Glu Glu Val Ala Ala Arg Arg Ala Arg Val Val Trp Cys Ala 

hlnPI 
nstl 
fspl 
fnu4HI 

alul hhal bstXl 
alwNI bbvl brsl 
1102 GTG GGC GAG CAG GAG CTG CGC AAG TGT AAC CAG TGG AGT GGC TTG AGC 
350 Val Gly Glu Gin Glu Leu Arg Lys Cys Asn Gin Trp Ser Gly Leu Ser 

fnu4H] ml] 

bbvl bspMI mil haelll mil sfaNl 

1150 GAA GGC AGC GTG ACC TGC TCC TCG GCC TCC ACC ACA GAG GAC TGC ATC 
366 Glu Gly Ser Val Thr Cys Ser Ser Ala Ser Thr Thr Glu Asp Cys He 

serf] 

ecoRll bs-tXl 
bstNl alul sfaNl nlalll fokl mil 

1198 GCC CTG GTG CTG AAA GGA GAA GCT GAT GCC ATG AGT TTG GAT GGA GGA 

382 Ala Leu Val Leu Lys Gly Glu Ala Asp Ala Me-t Ser Leu Asp Gly Gly 
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nlalll ^ 5 ntalV scrFI 
sphl hglCI ecoRII 

rsal nspCI ban I bs-tNI 

1246 TAT GTG TAC ACT GCA TGC AAA TGT GGT TTG GTG CCT GTC CTG GCA GAG 
398 Tyr Val Tyr Thr Ala Cys Lys Cys Gly Leu Val Pro Val LeU Ala Glu 

sau3Al 
nbol 
dpnl 
alwl 

1294 AAC TAC AAA TCC CAA CAA AGC AGT GAC CCT GAT CCT AAC TGT GTG GAT 
414 Asn Tyr Lys Ser Gin Gin Ser Ser Asp Pro Asp Pro Asn Cys Val Asp 

sau3AI 
nbol 

ecoNI ecoRV dpnl 

1342 AGA CCT GTG GAA GGA TAT CTT GCT GTG GCG GTG GTT AGG AGA TCA GAC 
430 Arg Pro Val ^Glur Gly Tyr Leu Ala Val Ala Val Val Arg Arg Ser Asp 

scrFI 

ecoRII 

bs-tNI 

1390 ACT AGC CTT ACC TGG AAC TCT GTG AAA GGC AAG AAG TCC TGC CAC ACC 
446 Thr Ser Leu Thr Trp Asn Ser Ya! Lys Gly Lys Lys Ser Cys His Thr 

haelll 

nlalll 

sty I sau961 nboll 
pstl ncol asul earl 

1438 GCC GTG GAC AGG ACT GCA GGC TGG AAT ATC CCC ATG GGC CTG CTC TTC 
462 Ala Val Asp Arg Thr Ala Gly TrP Asn lie Pro Me1 Gly Leu Leu Phe 

nlalV 
hgUII 
bspl286 

ban II sspl alul bspl286 

1486 AAC CAG ACG GGC TCC TGC AAA TTT GAT GAA TAT TTC AGT CAA AGC TGT 
478 Asn Gin Thr Gly Ser Cys Lys Phe Asp Glu Tyr Phe Ser Gin Ser Cys 

sau3AI 

nbol 

Dpnl 

scrFI xholl 
ecoRII bs-tYI hglAI 

bs-tNI aval bglll bspl286 

1534 GCC CCT GGG TCT GAC CCG AGA TCT AAT CTC TGT GCT CTG TGT ATT GGC 
484 Ala Pro Gly Ser Asp Pro Arg Ser Asn Leu Cys Ala Leu Cys lie Gly 
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hphl bspl286 
1582 6AC GA6 CAG GGT GAG AAT AAG TGC GTG CCC AAC AGC AAT GAG AGA TAC 
510 Asp Glu Gin Gly Glu Asn Lys Cys Vol Pro Asn Ser Asn Glu Arg Tyr 

nlalV 
hglCl 

ban I scrFl 
nspl ecoRIl 
bsrl hpall bstNl ddel bsnl bsnal 

1630 TAC GGC TAC ACT GGG GCT TTC CGG TGC CTG GCT GAG AAT GCT GGA GAC 
526 Tyr Gly Tyr Thr Gly Ala Phe ArG Cys Leu Ala Glu Asn Ala Gly Asp 

1678 GTT GCA TTT GTG AAA GAT GTC ACT GTC TTG CAG AAC ACT GAT GGA AAT 
542 Val Ala Phe Val Lys Asp Val Thr Val Leu Gin Asn Thr Asp Gly Asn 

fnu4HI 
bbvl 
hlnFl 

nnll nlalll ddel alul hhal 

1726 AAC AAT GAG GCA TGG GCT AAG GAT TTG AAG CTG GCA GAC TTT GCG CTG 
558 Asn Asn Glu Ala Trp Ala Lys Asp Leu Lys Leu Ala Asp Phe Ala Leu 

taql fnu4HI 
nnll nnll bbvl 

bgll ddel alul 

1774 CTG TGC CTC GAT GGC AAA CGG AAG CCT GTG ACT GAG GCT AGA AGC TGC 
574 Leu Cys Leu Asp Gly Lys Arg Lys Pro Val Thr Glu Ala Arg Ser Cys 

sau96l 
nlalV 
nlalll 
sty I haelll 

ncol asul hlnfl nlalll bsnal fokl 

1822 CAT CTT GCC ATG GCC CCG AAT CAT GCC GTG GTG TCT CGG ATG GAT AAG 
590 His Leu Ala Met Ala Pro Asn His Ala Val Val Ser Arg Met Asp Lys 

fnu4HI 
ecoNl alwNI bbvl 
1870 GTG GAA CGC CTG AAA CAG GTG CTG CTC CAC CAA CAG GCT AAA TTT GGG 
606 Val Glu Arg Leu Lys Gin Val Leu Leu His Gin Gin Ala Lys Phe Gly 

sau3Al 

nbol nspl 

dpnl hpall 
xholl scrFI 
bstYl ncll 

alwl caull bsrl 

1919 AGA AAT GGA TCT GAC TGC CCG GAC AAG TTT TGC TTA TTC CAG TCT GAA 

622 Arg Asn Gly Ser Asp Cys Pro Asp Lys Phe Cys Leu Phe Gin Ser Glu 
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haelll 
hael 

eael siyl 
ddel cfrl plel ncol 

dralll ball hlnfl 

1966 ACC AAA AAC CTT CTG 7TC ATT GAC AAC ACT GAG TGT CTG GCC AGA CTC 
638 Thr Lys Asn Leu Leu Phe Asn Asp Asn Thr Glu Cys Leu Ala Arg Leu 

sau961 

avail 

asul 

nlalll ndel sspl nlalV 

HOI 4 CAT GGC AAA ACA ACA TAT GAA AAA TAT TTG GGA CCA CAG TAT GTC GCA 
654 His Gly Lys Thr Thr Tyr Glu Lys Tyr Leu Gly Pro Gin Tyr Val Ala 

scrFl 
ecoRll 

hglAl tasiNl 
bsplH86 nnll nnll 
2062 GGC ATT ACT AAT CGT AAA AAG TGC TCA ACC TCC CCC CTC CTG GAA GCC 
670 Gly He Thr Asn Leu Lys Lys Cys Ser Thr Ser Pro Leu Leu Glu Ala 

ddel 
ns-tll 

nnll sau961 
ecoBH nboll hael 11 

ecoRl bsu361 nboll asul alul 

2110 TGT GAA TTC CTC AGG AAG TAA AACCGAAGAA GATGGCCCAG CTCCCCAAGA 
685 Cys Glu Phe Leu Arg Lys DCi 

s-tyl 
hael II 
sau961 

nboll scrFI asul 
ddel earl ecoRII nlalV 

nnll alul bsiNI eco01D91 nlalV 

2161 AAGCCTCAGC CATTCACTGC CCCCAGCTCT TCTCCCCAGG TGTGTTGGGG CCTTGGCTCC 

ecoNl fokl ddel 

2221 CCTGCTGAAG GTGGGGATTG CCCATCCATC TGCTTACAAT TCCCTGCTGT CGTCTTAGCA 

2281 AGAAGTAAAA TGAGAAATTT TGTTGATATT CAAAAAAAA 

>LENGTH« 2319 
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Shown below Is -the DNA sequence of "the hybrid splice. The 
predicted Intervening sequence (shown In lower case) 
consists of the 5'-end of 1VS-1 from bovine aSl casein 
(from position +54 to +180 with respect to the start 
of transcription) fused to the 3'-end of a human IgG 
splice sequence. The Hlndlll site Cm bold type and 
underlined) derives fron the IgG sequence and narks 
the Junction between the aSl and IgG splice sequences. 
The 5'-end upper case sequence depicts the conplete 
exon one of the bovine aSl casein gene. The 3'-end upper 
case sequence represents the splice Junction of the IgG 
gene through to the PstI site (CTGCAG) incorporated in 
the cloning vector, pMHl. 



5'- ATCACCTTGA TCATCAACCC AGCTTGCTGC TTCTTCCCAG 

TCTTGGGTTA AAG giattaigia tacataiaac aaarittrta tgattttcct ctgtrtartc 
tHc&ttctt cartatrtacg wffttgiaac ttttrtatgt gattgcaagt attggtartt tcrtaigirta 
lartgttagc aagcttgagg tgtggtaggc ttgagatctg gccaiacact Igagtgacaa Igacaiccac 

•mgcctttc trtcccacag GTGTCCACTC CCAGGTCCAA CTGCAG -3' 
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